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HISTOLOGICAL EXAMINATION OF A CASE 
OF ALBINISM. 


By J. E. ADLER, F.R.C.S. anp J. M°INTOSH, M.D. 
(From the Bacteriolegical Laboratory, London Hospital, E.) 


ALBINISM has now long been a subject of interest to medical science, and many 
papers have appeared from time to time descriptive of its clinical peculiarities. 
The literature on the histological findings of the affected parts is, however, most 
scanty, and is usually confined to descriptions of pieces of the iris removed at 
operations for traumatic and other affections. 


In looking through the literature only one record of a complete post-mortem 
examination on an albino could be found. This was made by Buzzi in 1783 (2). 


The case from his description appears to have been a complete albino, but his 
descriptions of the minute anatomy are far from satisfactory. Carron du Villards (1) 
and Maury (8) record dissections of human albino eyes; the former with absence of 
all pigment, the latter with traces of pigment. 

It is a pity that a more uniform system of classification has not been adopted 
by the various observers in the description of their cases, so that it might have 
been possible to get a correct idea of the condition observed. 

All the cases on which histological observations have been made, with the 
exception of Buzzi’s case, would come under the groups of incomplete albinism, 
according to the classification proposed by Geoffroy Saint Hilaire (3). 

In the case of partial albinism recorded by Mueller (9), there was a complete 
absence of all pigment in certain areas of the iris. In Nettleship’s case there was 
a complete absence of pigment in the portion of the iris examined which was 
probably a fair sample. On the other hand, in Usher's cases of incomplete albinism 
recorded by Nettleship (10), there was diminished pigment in both choroid and 
retina. Clinically Usher’s cases might pass for complete albinos, and this would 
mean that we could not distinguish by clinical methods between cases of incomplete 
and complete albinism. 

The case with which we are about to deal, and which we had the opportunity 
of examining, was that of a female child of 10 weeks. The child was the second 
albino in the family, and the mother had taken the matter so much to heart that 
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she committed suicide. She left the child exposed, and from the effects of this 
it died. 


Family History. The family history of the child was specially investigated in 
order to ascertain whether there was a hereditary history of albinism; but no 
evidence of albinism could be found in any of the ancestors or relations in the four 
previous generations. The description of the family tree (see p. 243) connected 
with the case is as follows (unless marked as albino, all members are normal) :— 

Generation IV. 12. B. C., aet. 10, albino—pink eyes, nystagmus, white hair, big healthy 
boy, very bright. 13. L. C., aet. 8, ¢ healthy. 14. —.C., aet.6, 9 healthy. 15. N., aet. 4, ¢. 
16. O. C., aet. 3, Q healthy. 17. C. C, aet. 6 weeks, 2 albino, the case described. 
IV. 1. S. W., aet. 25, ¢ healthy. 2. L. W., aet. 24, ¢ healthy. 3. E. W., aet. 22, 2 healthy- 
4. B. W., act. 22, 9. 5. P. K, aet.?, dg. 6. A. K., aet.2, 9. 7 G BD. ae. 13, ¢&. 
& D. Be, at. 6, ¢. 0 J. Bat. o Gg. 10. &. EB, act. 10, 9. 11... MB, ack 11, 9. 
18. B.S, aet.9,g. 19. D.S, aet. 7}, 9. 


Generation III. 1. R. W., father of family IV. 1—4. 2. M. A. W., wife of III. 1, healthy 
(eldest sister). 3. A. K., sister of IIL.,2. 4. P. K., ber husband, father of family IV. 5—6. 
5. L. C., mother of IV. 7, and wife of III. 6. 6. G.D., husband. 7. L.C., 9, died. 8. E.C., 


Z. 
9,died. 9. D.C. g, married to III. 10. 10. 8S. E., 9, became wife of II]. 9, mother of IV. 
8—ll. 11. A.C, g, died. 


12, A. C., g, youngest child, father of case, married IIT. 27 (L. R.), 
alive. 13—17, Children of W. C., II. 3—4, all unaffected, but all details unknown. 18, 19, 20, 
and 21, Children of A. C., II. 5—6, all unaffected, details unknown. 22. A. R., 9, spinster, 
pale, but not a true albino. 23. F.R., 2. 24. G.R. g,inasylum. 25. H.R. ¢, married 
D.S., 9, III 26. 9, wife of IV. 18,19. 27. 9, L. R., married A. C. III. 11, died 
examined, found unaffected, mother of the two albinos in the family. 28. ¢, S. R., just 
married to III. 29. 2. 30. 9, P. R., just married to III. 31. ¢, B. 32. If any, and 
how many, unknown. 33—34. Offspring of II. 11 and 12, age and sex unknown, known however 
to be normal. 35. If any, no details known. 36—45. Offspring of II. 15—16, all unaffected, 
but no details as to order and sex. 

Generation II. 1. D.C. alive, ¢, aet. 74, unaffected. 2. M. A. S., 2, wife of above, 
unaffected, parents of III]. 1—12. 3. W. C., ¢, died, age unknown, unaffected. 4. 9, his 
wife, unaffected. 5. A. C., ¢, died, age unknown, unaffected. 6. 9, his wife, unaffected. 
7. A.C. 9, married G. R. (8), ¢, parents of family III. 22-30. 9. S. C., 9, married 
10. ¢, age and initials unknown. 11. L. C., 9, married 12, ¢, age and initials unknown. 
13. A. C., 9, married 14, ¢, age and initials unknown. 15. M. C., 9, married 16, ¢, age 
and initials unknown; II. 15—16 parents of III. 36--45. 17. 9, sister of G. R., IL. 8, 
unaffected. 18. 9, sister of G. R., Il. 8, unaffected. 19. 9, sister of G. R., IL. 8, unaffected. 
Generation I. 1. D. C., ¢, died, aet. 72, unaffected. 2. 9, wife of above, unaffected, all 
other details unknown. 3. R., ¢, unaffected, all details unknown. 4. 9, unaffected. 


Pathological Anatomy—Post-mortem Notes. 
External Appearances. The body was that of a well nourished female child of 
about six weeks. Rigor mortis was absent. 
There were small papular erythematous patches on the extensor aspects of 
both wrists, forearms and legs; symmetrically arranged ecchymoses were seen on 
the upper and inner aspects of both thighs. 
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The back, shoulders, and sides were covered with fine white downy hairs. The 
nipple, umbilicus, perineum, and anus were unpigmented. The scalp was covered 
with a large quantity of fine silky hair about an inch long. No hairs were pig- 


mented. The eyebrows consisted of white hairs, while the eyelashes were regular, 
even, and white. 


The eyes were pink ; the pupils equal, and moderately small. The irides were 
entirely devoid of all pigment, and were pinkish in colour, with fine greyish white 
radially arranged striations. 

Weight, 7 lbs. 14 ozs. Circumference of head, 144 ins. 


On opening the thorax, the thymus gland was found to be enlarged, and 
reached down as far as the level of the third costal cartilage; weight, $ oz. 
There was no excess of fluid in either pleural cavity, and there were no pleuritic 
adhesions. The lungs were collapsed, definite capillary bronchitis being present. 
The mucous membrane of the bronchi was congested, and the larger bronchi 
contained some yellowish pus. There was no consolidation. Weight, right lung, 
1} ozs.; left lung, 13 ozs. 


The Heart. There was no excess of fluid in the pericardium. The foramen 
ovale was not completely closed, and admitted a fine probe. The valves and aorta 
were normal in appearance. Weight, 3 oz. 


The Stomach. The stomach was almost empty, and very pale in colour; two 
or three ecchymotic spots were noticed on its anterior surface near the greater 
curvature. The stomach contained about two or three drachms of mucous fluid, 
which was odourless. The mucous membrane appeared healthy. 


The Liver protruded one finger’s breadth below the costal margin, and, on 
being cut into, appeared to be normal. No thickening of the capsule was noticed. 
The gall bladder was filled with the usual yellow bile, and the ducts were found 
patent. 


The Spleen (4 oz. in weight) was firm in texture. On section, the malpighian 
bodies were not distinctly marked. 


The Kidneys (4 oz. each in weight) were lobulated. Their capsule stripped 
readily, and the perirenal fat was plentiful and normal in colour. The ureters 
were patent, and were followed into the bladder. The latter was empty, and was 
healthy in appearance. Both suprarenal glands were present, and appeared 
normal, 


The small intestine was, for the most part, thin and empty, and the mucous 
membrane, though pale in colour throughout, appeared to be healthy; while the 
Peyer’s patches presented a normal appearance. The large intestine, appendix, 
and rectum were examined and found to be healthy. The pancreas appeared to 
be normal, both in regard to structure and weight; the duct of Wirsung was 
patent. The mesenteric, bronchial, cervical, and axillary glands were normal in 
size, and presented the usual appearance on section. The tongue, tonsils, and 
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vesophagus were normal. The uterus and cervix were infantile in shape, but the 
ovaries were both enlarged and cystic. Pieces were removed from each organ, and 
placed in a solution of formalin (1 in 10) for histological examination. 


The Brain was removed intact (weight, 1 lb. $ 0z.), and was very soft. The 
dura mater, as is usual in children, was adherent at the suture lines. The cerebral 
sinuses were found to be normal. The pituitary body with the sella turcica 
appeared to be normal, and were removed en masse. 


Histopathology. All the organs and tissues were examined microscopically, 
but, apart from a complete absence of any pigment in certain parts, no histological 
peculiarities were observed. With regard to the liver, spleen, lungs, kidneys, 
pancreas, thymus, thyroid, heart, and muscles, no abnormality was made out; 
while in the other structures, namely, brain, eyes, internal ear, suprarenal gland, 
skin, and hair, the only noticeable feature was a complete absence of pigment. 


Brain. Sections of the cerebrum, cerebellum, crura cerebri, pons Varolii, and 
medulla oblongata were examined, both in the stained and unstained condition. 
Search was made for pigment in those parts in which pigmentation normally 
exists in the brain cells in the adult. The various nuclei of the pons, medulla, and 
cerebellum, the substantia nigra and the grey matter were examined, but contained 
no pigment granules. 


Sections of these areas were also stained with fluorescein, but no granules were 


found. 
The pituitary body was apparently normal in structure. 


The Eyes. Both eyes were examined completely, one was fixed in formalin 
(10°/,) and the other in spirit, and then they were frozen solid and cut into 
right and left halves. The internal aspect of the eyeballs was like yellowish white 
parchment. The vessels were clearly outlined on their surface, while the fovea 
centralis appeared as a distinct spot of a slightly deeper tint than the surrounding 
retina. 


Paraffin and celloidin sections were made and examined under the microscope 
by day and artificial light, without even a trace of pigment being found in any of 
the structures of the eye, whether of an ectodermic or mesodermic origin, retina, 
epithelium and stroma of iris or choroid : see Plate II. 


Owing to the entire absence of pigment in the eyes, the anatomical structure 
of the irides and retina was shown to great advantage, while the arrangement of 
the delicate connective tissue of the iris and its continuity with the choroid and 
ciliary processes were clearly seen. The arrangement of the two sets of muscle 
fibres could be followed. The iris also showed a well-marked crypt. Unfortunately 
the retina had not been fixed soon enough after death to preserve the rods and 
cones layer intact, their situation however can be easily recognised. 


The choroid and retina were examined in the stained and unstained condition. 
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Internal Ear. The petrous portion of the temporal bone containing the 
middle and internal auditory apparatus was removed entire and decalcified. The 


internal ear contained no pigment in the mesoblastic cells of the perilymphatic 
spaces. 


The Suprarenal Gland. The medulla and the cortex appeared to be of the 
usual proportions; no brown granules were to be observed in those cells of the 
cortex which are usually found to contain pigment in adult life. 


The Skin. Except for the absence of pigment, the only noticeable feature seen 
in sections of the skin was the non-granular appearance of the rete mucosum. 


The Hair contained no pigment; after the hairs were cleared in xylol they 
became quite transparent, and could be seen only with considerable difficulty. 


In conclusion, one must say that the case described above is one of complete 
albinism, on account of the complete absence of any melanin pigment in those 
structures which normally contain it. 


This child, though it might also be described in Professor Karl Pearson’s 
terminology as an internal albino, does not solve the question as to whether 
internal albinism exists in adult albinos, because practically no pigment exists in 
any part of the brain of a child before the sixth month of life (Marinesco (7)). 
The same is also true for those glands, such as the suprarenal, which contain 
pigment in adult life (Koelliker (5)). 


There was no evidence of any maldevelopment or mental feebleness in any 
member of the family, such as has been reported in certain albinotic stocks. But 
though there is undoubted proof that in certain instances deficiency of pigment 
does go along with such conditions, we have no reason to suppose that there is a 
constant relation between them. 


In all probability this child, had it lived, would have remained a complete 
albino even in spite of the fact that the colour of the iris in all infants darkens 
considerably during the first year of life, because it is very unlikely that pigmenta- 
tion would occur after so long a period of complete absence, namely, over 10 months, 
as melanin pigment, according to Koelliker (4), makes its appearance in the retina 
towards the fourth week of intra-uterine life. The specimens kindly lent by 
Dr Low (6) showed distinct pigment in the retina at this period: see Plate I. 


Where the albinism entered this family is not at all clear; if the condition is 
not a sport it is most likely that the condition was handed down from the 
(paternal) grand-parents, as the parents of the child were cousins. From a 
Mendelian view both parents might be described as carrying recessive albinism, 
but we have really no evidence from this single case to support this view. 
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DESCRIPTION OF PLATES. 


Puate I. Fic.1. Section of Eye in 4th week embryo. " 
Fic. 2. Section of Eye in 10th week foetus. 


We owe these sections to the kindness of Dr A. Low. 


PruatelI. Fic. 3. Section of retina of albino case. 
Fic. 4. Section of retina of normal infant. 


Fic. 5. Section of iris of albino case. 


Fic. 6. Section of iris from normal infant. 
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Fic. 1. Section of eye in 4th week embryo (10 mm, long) 
(a) 


Pigment granules in pigment layer of retina. 
(b) Lens. 


[Dr Low’s section] 
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Fria, 2. Section of eye, 10th week foetus (28 mm. long) 
(a) Pigment layer of retina 


[Dr Low’s section] 
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NOTE ON INTERNAL ALBINISM. 


By KARL PEARSON, F.R.S. 


THE authors of the previous paper having referred to my use of the term 
internal albinism, it may not, perhaps, be out of place to say a few words on that 
subject. This is not done because any conclusions on this point have been so 
far worked out, but because the investigations upon which they must turn will 
perforce have to be made by others with not only the adequate training but the 
requisite material,—a material which is rarely available, and which without such 
suggestions, as I wish to bring forward, may possibly not be used to the maximum 
advantage when it does occur. This must be my excuse for putting forward some 
rather vague suggestions instead of demonstrated facts; 1 hope that, among many 
other questions of interest, they may be partially answered when the opportunity 
arises. 

With the exception of an occasional eye or part of an eye no post-mortem 
account of a human albino has appeared in print as far as I am aware between 
1783 and 1910. Buzzi in 1783 had not the histological knowledge which would 
have enabled him to throw light on our modern problems. In Adler and M‘Intosh’s 
case the age of the child was too slight for the case to give information as to the 
internal pigment of a fully developed human albino. In the case of the post- 
mortem examination recorded below the answers to several of my problems could 
have been given, but the problems were not in the mind of the medical man who 
made the examination. It is because such examinations must be very infrequent 
that I venture to throw out my queries and suggestions in the hope that they may 
reach physiologists and pathologists; and that although originating in a layman 
they may consider them worthy of investigation. 

In the case of albinism in man there is no doubt of the largely relative 
character of the absence of superficial pigment*. Almost every grade of albinism, 
whether we consider the eyes, hair, or skin, occurs. But there is something more 
than graduated intensity of pigmentation ; there may be local complete absences 
of pigment; besides incomplete albinism, we have partial albinism and this in 
a great variety of stages. Intensity and distribution of the superficial pigment 


* The terms “ superficial pigment ” and “ external albinism” are used to cover the visible pigment, 
or pigment exposed to the action of light. They include the retinal pigment although its morphological 
origin brings it into closer relation with the internal pigment of the brain centres. 
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varies immensely in man. Now besides the superficial pigment, there occurs 
in man internal pigment, notably in certain brain centres, internal ear, liver, 
kidneys, suprarenals, lungs, etc. > It can scarcely be doubted that this internal 
pigmentation plays some part in the case of several of the organs mentioned, 
possibly a most important part in the case of the brain centres, the suprarenal 
glands, and the internal ear. Now is there any reason for supposing that this 
pigmentation varies like the superficial pigmentation in intensity and locality ? 
If so, what are the signs, outward and visible, of incomplete or partial internal 
albinism? Where would it be most reasonable to look for the occurrence of such 
cases? It is noteworthy that pigment superficially injected into an albino does 
not remain on the surface, but is fixed by the suprarenals, liver, spleen and lungs; 
it is eliminated by the kidneys and intestinal epithelium. In Addison’s disease, 
not only bronze areas but leucotic areas occasionally appear. It would seem reason- 
able therefore to suppose that there is not absolute independence between internal 
and external pigmentation ; other cases could be cited, but the suggestion made 
is merely that it is not wholly unreasonable to look first for defects of internal 
pigment in those individuals or stocks where we find defects of external pigment. 


Of course it is logical in the first place to demonstrate that internal pigment 
occurs in albinos at all. Adler and M‘Intosh’s case throws no light from the 
human albino on this point; no more did Buzzi’s. But Mr O’Donoghue* kindly 
investigated some albino rabbits, rats and mice for me and he assures me that 
internal pigment does occur in these as in normal rats and mice. We may assume 
that this is true for human albinos, but this is just one of the points which need 
elucidation as the occasion arises. However, the suggestion made is that it might 
be reasonable to look for defects, partial or incomplete of internal pigment, in the 
same stocks as we find albinism occurring in—either in the case of albinotic, 
or in the case of non-albinotic members. 


I now pass to some special points. The partially albinotic cat, the cat with 
white hair and blue eyes, is invariably deaf. Recent histological examination 
of such cats has shewn that among other important deviations from the normal 
in the internal ear, the walls of the perilymph chamber lack pigmentation. We 
are not justified as yet in saying that this is related to the deafness, but the 
possibility that it is must be borne in mind. I am not aware of any careful 
histological examination of the pigment in the internal ear of congenital deaf- 
mutes. That some forms of congenital deaf-mutism arise from partial internal 
albinism is conceivable and worth inquiring into as opportunity arises. 


* «T have found that the absence of pigment so noticeable in the external appearance of albinos does 
not seem to affect the internal organs. In examining the viscera of albino rabbits I found it impossible 
to discover any difference in colour that would enable them to be distinguished from those of normal 
animals, The spleen, liver, bile, kidney, adrenal body, Peyer’s patches and salivary glands are to all 
appearances quite similarly coloured. Two rats and two mice, one of each pair being an albino and the 
other normal, gave similar results on examination and in addition the mesenteries of both normal and 
albino were alike in containing pigment cells whose presence was demonstrated by silver nitrate.” 
Chas. H. O’Donoghue. 
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The disappearance of superficial pigment is one of the marks of senility, it 
affects the eyes and hair alike. Is it possible that this loss of superficial pigment 
can ever be accompanied by a loss of internal pigment? For example, possibly in 
the case of the brain centres? Senile imbecility and in its milder form mere 
senility might possibly be associated with a weakening of the intensity of pig- 
mentation in certain of the brain centres. The bigger question now arises: Is it 
conceivable that any forms of imbecility, not all, but any, are associated with 
defective brain pigmentation? ‘This opens up an exceedingly interesting inquiry 
for those who are able to examine the brains of imbeciles and the senile insane 
—what, if any, differentiation is there in the centres of brain pigmentation * ? 
Ophthalmoscopic investigation shows a high percentage of incomplete albinism 
of the eye among the insane. 

The Japanese mouse is usually a piebald with completely albinotic eyes, it 
possesses the singular habit of dancing, or better of spinning. This has been 
attributed to various causes. Is it wholly and absolutely certain that the spinning 
characteristic of this mouse may not also be associated with its partial albinism ? 
A special type of human imbecile is the spinning idiot, who rotates like a top. 


Fawn or red pigmentation occurs in close association with many types of 
albinism. Some types of red hair are, we have just found, wholly lacking in 
pigment granules. I have some reason for believing that a larger proportion of 
red hair will be found among congenital deaf-mutes than in the general population ; 
it will in a forthcoming memoir on albinism by Messrs Nettleship, Usher and 
myself be shown to be much in excess in albinotic stocks. Examining albinotic 
stocks there appears, partly in the albinotic and partly in the non-albinotic members 
to be an excessive proportion of imbecility, idiocy and deaf-mutism+. It may well 
be that these associated defects are only stigmata of general degeneracy, but the 
possibility of their having some relation to absence of internal pigment is, I venture 
to think, worth bearing in mind. So much for the suggestions I wish to make ; 
they at least suggest observations which can be made by those so fortunate as to 
be able to examine histologically cases of albinism, imbecility and deaf-mutism. 
They may be quite illusory but even an erroneous suggestion is often a useful 
guide in directing observation to unregarded points, and the authors of the above- 
mentioned memoir on albinism have sought in vain for any report on possible 
differentiation in the internal pigment of albinos—it does not appear to have been 
dealt with. 


In the account which I give below of a third autopsy on an albino, the subject 
was a boy, who combined deaf-mutism and spinning idiocy with albinism. Any 
two of these characters would on the theory of probability be an extraordinarily 
rare chance combination, but the existence of all three seems almost to demonstrate 
some interrelation in pathological origin. This deaf-mute spinning albino was the 
son of a deaf-mute mother. Dr Alfred Miller, the Medical Superintendent of the 


* Dr G. E. L. Keyes, I am glad to say, has this investigation in hand. 
+ Actual statistics will be given in the memoir referred to above, 
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Warwick County Lunatic Asylum, to whom I owe a copy of the Asylum notes on 
this case, has made full inquiries as to the family history of the patient, but without 
success, this being no doubt dué to the fact that the patient was illegitimate. 
I have cordially to thank Sir James Crichton-Browne, for mentioning that he had 
many years ago seen this boy, and suggesting inquiry at the Asylum. 


Particulars as to a Deaf-mute Albino Idiot. 


Admitted, July, 1852. “W. L. aet. 9. A congenital idiot. Is the natural 
son of a deaf and dumb mother. For some time after going to the Birmingham 
Asylum where he has been about 16 months he had frequent bleeding from the 
nose, and used to scream a good deal in the night. He is deaf and dumb; but not 
destitute of intelligence. He is very active, running and jumping about, climbing 
chairs, tables, etc., and has a habit of spinning like a top, though not nearly so 
much as formerly. He is of an affectionate disposition.” There is nothing interest- 
ing related in the notes after this date. He is stated to have had frequent attacks 
of severe epistaxis—and generally to have been in poor health. 


He died on Sept. 18, 1865. 


“ Post-mortem examination 23 hours after death. Body extremely emaciated, 
rigor mortis well marked. Skull thick and dense. Dura mater thickened and 
firmly adherent. Arachnoid slightly thickened. Convolutions of cerebrum ad- 
herent. 


Cerebrum.—Middle lobes very prominent and the corresponding fossae at the 
base of the skull proportionately deep. Grey matter of convolutions firmer than 
usual. 


Punctae sanguineae very numerous. About three drachms of fluid in each 
lateral ventricle. Choroid plexuses granular. Right optic thalamus eroded super- 
ficially along the tract of the taenia semicircularis and its substance soft and 
easily broken down. Anterior part of the floor of the 3rd ventricle of a gelatinous 
appearance and a buff colour. Pineal body soft and friable. Corpora quadri- 
gemina soft and almost diffluent. Cerebellum.—A small patch of superficial 
softening on each side of the anterior part of the under surface. Olivary bodies 
small and ill-defined. Pleurae adherent throughout. Pericardium universally 
adherent externally. Lungs full of tubercular matter. 


I certify that the apparent cause of death of the above-named W. L. was 
Phthisis (chronic).” 
W. H. Parsey, Medical Officer. 


(Copied and forwarded to K. P. by Alfred Miller, Medical Superintendent, 
in letter of Jan. 25, 1909.) 











ON A NEW METHOD OF DETERMINING CORRELATION, 
WHEN ONE VARIABLE IS GIVEN BY ALTERNATIVE 
AND THE OTHER BY MULTIPLE CATEGORIES. 


By KARL PEARSON, F.R.S. 


(1) IN a recent paper* I have dealt with the case when one variable is given 
by alternative categories and the other proceeds by quantitative groupings, for 
example when a population whose ages are recorded is classed into anaemic and 
non-anaemic ; or a population whose cephalic indices have been measured is 
classed into conscientious and unconscientious sections. The object of the present 
paper is to carry the idea involved in such double row correlation tables a stage 
further by supposing the variable classified into multiple categories to be purely 
qualitative. Such variables I have elsewhere spoken of as categoric variables, to 
distinguish them from graduated variables. As illustrations we may take the eye 
colour of an individual and the presence or absence of pulmonary tuberculosis. 
Here the eyes may be grouped into seven or eight classes, but we can only record 
the presence or absence of the disease ; it is true that the immunity of the indi- 
vidual is almost certainly a graded variable, but we are not able at present to mea- 
sure it, and must be contented with an alternative classification. Again, the presence 
of malaria and the skin tint is another illustration from the same field; for while 
skin tint is undoubtedly a graduated variable, no medical inquirer has probably 
the energy or time to do more than group into multiple classes—perhaps seven or 
eight—separated by certain skin tint mosaics. 

Hitherto such double row contingency tables could only be reduced by using the 
fourfold process, originally published by me in the Phil. Trans. Vol. 195 A, pp. 1—4. 
Such a process has two disadvantages; it assumes that a graduated variate which 
follows the Gaussian law is at the bottom of both classifications; and further it 
requires us to make one fourfold grouping, where many are possible. The choice 
left to the operator is not unique, and different selections may modify somewhat 
the result. Even if the mean result of taking several divisions be adopted, we do 
not get rid of an arbitrary element in the process, and the labour may occasionally 
be excessive. 

The fourfold process, absolutely necessary as it often is, should on the whole be 
reserved for those cases in which a fourfold division has arisen from the very 
nature of the data and not be applied to double row tables, where by using it we 


* Biometrika, Vol. vit. pp. 96—105. 
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lose all the advantage of having one variable in multiple categories. It is further 
to be noted that in many cases it is impossible to suppose any adequately graduated 
variable behind the multiple categoric variate. For example, if we wish to find 
the relationship between crime and occupation, we may easily select 10 or 12 
groups of kindred occupations, and discuss the number of convictions in each such 
group of individuals, but it would be unreasonable to suppose any continuous 
variable behind this grouping, while we might reasonably suppose a continuous 
variable behind the tendency to criminality. The present method has the 
advantage of giving a unique solution for double row contingency tables; it makes 
less appeal to hypothesis than the fourfold division method applied to such cases, 
and it is far more rapid in execution. 

Finally, the necessity for some such method has been forced upon my notice by 
the great frequency with which such double row contingency tables have recently 
occurred in the work of those dealing with medical, sociological and criminological 
statistics in my laboratory. 

(2) The theory of the method is very simple. Let y be the categoric variate 
in multiple classes, and « the alternative variate. We suppose « to be ultimately 
continuous, but by using y we do not suggest continuity. For such a system the 
correlation ratio, 7, has a perfectly definite meaning, it is the ratio of the standard 
deviation of the weighted means of the y-arrays of z’s to the standard deviation 
of the whole population ; in symbols: 
mae: S {Ny (Ly — x) 
Piles é No? 
where n, is the number of individuals in any y-category ; x, is the mean & for this 
category and % and oc, are the mean and standard deviation of the a’s of the whole 
population VN. This value of 7, it is well-known*, must lie between 0 and 1; it 
becomes equal to r, the correlation coefficient, when the regression is linear. 
Further the mean standard deviation of the weighted arrays is known to be 
o,V1—y, a value which becomes small as » approaches unity, or « becomes 
absolutely defined by the y-category in which it lies. 

Now let ,o, be the standard deviation for the w’s which fall into a special 
category y, then we may take: 


1 ‘ a oe ZS fa 
n= S(n, 4.” =) ear Pikes «sce Danie Sica tial Roe (11), 
N \ — xylan Cx \Fx 
since S (nyty) = NZ. 


Now we have just seen that the mean value of ,o,f =o, Vvi- n*, we shall 
therefore assume that the distribution is sufficiently homoscedastic for us to 
replace ,o,7/o,° by its mean value (1—7’). Hence: 


» fe, ae’ z\? 
n = (1 — 9°) S( a — )-( 2 


\V yx , Cx 


* Roy. Soc. Proc. Vol. 71, p. 303. ‘On the General Theory of Skew Correlation and Non-linear 
Regression.” Drapers’ Research Memoirs, 1905, Dulau & Co. 
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and from this it follows that 


ca Sallia ) | Ralacaiticedieunttal (iii). 
+5 \" (=)t 


This is the value of the correlation ratio, based, so far, solely on the assumption 
that the arrays are sufficiently homoscedastic to permit of our replacing the ratio 
y7x/oz by its mean value. There has been no hypothesis of linear regression, still 
less of a Gaussian distribution. 








It now remains to consider on what hypothesis we shall determine «,/,o, and 
%/o,z, which are not directly determinable from the data. We shall assume that 
the mean of each array of «’s as well as the mean of the whole population may 
be found in terms of their respective standard deviations by tables of the 
probability integral. In other words we assume the distribution of the x-variable 
to be approximately Gaussian. Thus we find uniquely the correlation ratio of a 
double row contingency table by the following assumptions : 


(i) The alternative variate is sufficiently Gaussian to admit of the means 
being found by tables of the probability integral. 


(ii) The arrays are approximately homoscedastic. This latter condition is 
unnecessary, if the alternative variable be replaced by one grouped in three classes, 
for then the actual value of yo,/o, can be found for each array and we need not 
give it the mean value ¥1—y*. With regard to the multiple categoric variable 
no assumption as to the nature of its distribution has been, or need be, made. 


Illustrations. 


(3) I will take first some illustrations from a forthcoming paper dealing with 
the relation of pigmentation to psychical characters. 


(A) Conscientiousness in Boys and Hair Colour. The conscientiousness was 
classified as “keen” and “dull” with an intermediate class, which was found to be 
relatively very small and distributed accordingly between the two main groups. 
We have the following table: 


Hair Colour. 














% ted | Fair | Brown Dark | wit, Totals 
wD | 

Ss | | 

‘3 | Keen es 59°5 4415 | 433 354°5 | 20°5 1309 
& | Dull am 24°5 1945 | 188 162°5 | 16°5 586 
°S 

oa | 

8 Totals 84 | 636 621 517 | 37 1895 
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Karu PEARSON 


We have, using Sheppard’s Tables : 
loz = — 4980, 
and for the arrays in order from red to jet black: 
#,/o,, = — 5485, #,/o,,=— ‘5077, #,/o2, = — 5166, 
&,/o,,= —°4837, %,/o,, = —°1359. 
There is thus an almost continuous decrease in conscientiousness as we pass from 


red to jet black hair*. Or, there is a very slight but sensible relationship of hair 
pigmentation and conscientiousness. Using formula (iii) we find 


1 ( Ly \") cee 
S \ Thy 5 *) = “Qi ”. 
yn‘ ("Y Bs f 1 ) 
2515 — -248 
Hence n= - st » = ‘00276, 
and = 053. 


The assumption that conscientiousness is a continuous variate approximately 
following the normal distribution appears a reasonable one. 


(B) Self-consciousness and Hair Colour. The data for 1914 schoolboys are 
given in the following table : 


Hair Colour. 























; we ea | ; 
| Red | Fair Brown | Dark | fi ms Totals 

, | 

a 

Selfconscious ... | 42°5 | 344:25 | 296°5 | 238 4:25 | 935°5 | 

Unselfconscious ... } 36°5 | 322°75 | 351°5 | 2515 } 6-25 | 978°5 | 

| | 

| 

Totals 79 667 | 648 | 489°5+ | 305+ | 1914 | 

| | | 











Here Z/o, = + ‘02816, and for the arrays: 
Z,/o,=— 0958, #,/o,=— 0404, %,/o, = + 1066, 
G/o,= + 0346, s/o = + ‘0822 . 


: 1 y\? 
— y y i 52 
These give v* \n ( =r} 0052, 
whence we find » = 066. 


It will be seen that hair colour is more closely related to self-consciousness 
than to conscientiousness, the red and fair boys being more self-conscious than the 
dark haired boys—an experience which may I think be confirmed on adults. The 

* The values of x reduced to a common unit ¢,, are : %/¢,= —*5477, %o/o,= —*5070, Fs3/o,= —°5159, 
%,/o,= —*4880, Z;/0,= — *1357. 
+ One boy was placed intermediate in hair colour between dark brown and jet black. 
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means of the arrays are, however, not so smooth* as in the former case and there 
is some sign of the regression curve being heteroclinal. 


(C) Handwriting and Eye Colour. I have chosen this illustration because it 
admits of using formula (ii) directly. The material is represented in the follow- : 








9 
ing table: i 
Handwriting. 

alt "en | | | 
Pa Good | Moderate | Poor | Bad Totals | 
400 | | | 
S | | 
| Light ... | 45 | 249 | 312-75 | 1125 | 298-75] 748 | 
D | Medium ... 60 252 =| 302°75 | 99 27°25 aa. | 
| Dark as 41°5 | 137° | 182 | 2 <5 432 | 
ica) | | | 
Totals 146°5 638°5 | 797°5 | 274°5 64 1921 | 
! 











Let us call the range of medium eye colour h and measure the means first 
from the boundary of light and medium (#) and then from the boundary of 
medium and dark (a’). We havet+ 

%/o,='2809, @/o,= “7558, h/o,=1-0368, 

&,/o,=°'5039, %,'/o, 6731, h/o,=10770, o,/o,= ‘9626, 
#,/o,='2794, @,/o, ‘7880, h/o,=10674, o./o,= ‘9713, 
&/0,=°2737, @/o,= ‘7448, h/o,=1:0184, o,/o,= 1:0180, 
&Jo,=°2280, &//o,= ‘7405, h/o,= ‘9684, o,/o,=1°0708, 
G,/o5='1276, %'/o,= 11504, h/o,=1'2780, o5/o,= ‘8112. 


Il 


We see that the last column gives us the means of finding the ,o,/o, of the 
formula (ii). Also we have alternative methods of determining » according as 
to whether we use the first or second column, Le. # or a’. 

Using the first column we find : 

n? = 0835 — -0789 = 0046, 
or: m = ‘068. 

Using the second column we find : 

n/? = "5754 — 5713 = 0041, 
or: n = ‘064, 
The close accordance of these results speaks well for the application of the 


method to the present material. We may test it again, assuming that medium 
and dark eyes are classed together, and using formula (ili), we find in this case 


nn’ = ‘070, 


%4/o,= +°0345, Z;/o,= +°0821. 


* Reduced to a common unit ¢o,, they are %,/¢,=--0951, Z/o,= -°0403, %3/o¢,= +°1064, 
+ Reduced to o, we have Z,/¢,=°4851, %2/o,=°2714, %3/o¢,=°2780, X4/¢,=°2441, %;/o,=-1035. 
£ x / £ x > i x 


x 
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agreeing for all practical purposes with the value found for » above, and indicating 
that although the arrays as shown in the fourth column are heteroscedastic, it is 
practically sufficient to give yo,/o, its mean value. Examining Column 1, we see 
that the mean eye colour of the arrays shifts continuously towards the light as we 
pass from good to bad handwriters. The association is small, but can hardly be 
without significance. 

(4) Alcoholism is a character which may be reasonably treated as one of 
graduated intensity, and one accordingly to which our method may be reasonably 
applied. I take as an illustration some statistics of crime most kindly provided by 
Dr Charles Goring. 

(D) Alcoholism and Type of Crime. In the following table the crime 
categories have been arranged in the order of relative intelligence as ascertained 
from Dr Goring’s correlation investigations on crime and intelligence. 

Class of Crime. 











Arson | Rape | Violence Stealing Conn Fraud | Totals | 
| Alcoholic ik 50 88 155 379 | 18 | 63 753 | 
| Non-alecoholic ... 43 62 | 110 300 14 | 144 673 
| } | 
| Totals 93 | 150 | 265 679 | 32 207 1426 | 
i b ecacual | ee 








We find: #/o, = + ‘0704, and for the arrays*: 
@/o,=+°0946, #,/o,=+°2190, %/o,= +°2145, 
P4/o,= +°1463, %/o,;=+°1573, %/o,=—'5119. 
Using formula (iii), we determine, 7 ='234. There is thus quite a sensible 
correlation between the extent of alcoholism in the criminal and the type of crime 
he will commit +. Starting with fraud, there is increased alcoholism as we pass to 
stealing, coining, violence and rape. An exception, however, occurs in the case of 
arson, the class of crime whose perpetrators are least intelligent among criminals. 
Those who commit arson appear to be less alcoholic than coiners and thieves— 
criminals who require considerable sobriety for success in their callings. Probably 
the coiners would stand above thieves in the scale, did not the returns include the 
offenders who pass false coin as well as the producers ; further, highly skilled but 
alcoholic workmen, who cannot retain on that account their berths, may in the 
case of certain trades tend to gravitate into this class of criminal. While rape 
and crimes of violence arise from a low grade of intelligence combined with 
alcoholism, it would seem that arson has a low grade of intelligence for its source, 


but probably some character such as revenge rather than alcoholism ultimately at 
its source. 


* Reduced to a common unit: Z,/o,= +°0920, %/o,=+°2129, %3/o,= +°2085, %,/o,= +-1422, 
%;/¢,= +1529, %/o,= —°4977. 

+ The reader must remember that this has no bearing on the problem of whether alcoholism leads 
to crime. 
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(E) Alcoholism in Parents and Intelligence in the Offspring. I owe the data 
for this illustration to Miss Elderton, who is treating alcoholism in parents and its 
effect on children at length in a forthcoming memoir. Some of her data are given 
in the accompanying table : 


Intelligence of Sons (Edinburgh Data). 




















— i. fos 

| Excellent | Good Medium Dull | Defective | Totals | 

3 | | | 

=| Alcoholic +2 19 | 125 116 46 7 313 | 

£ | Non-alcoholic ... 23 97 99 45 9 273 | 
er} | 

a | 

Totals 42 222 | 215 | 91 16 586 | 

. Se et nts | 











We have: #/o, =—°0857, and for the arrays* : 
#/o,=+°1197, &%/o,=—'1587, %,/o,=— 0993, 
Z,/o,=—'0138, %/o,=+°1573, 
this last value being of very small weight. The results are most remarkable; if we 
omit the excellent intelligences, we obtain an almost uniformly descending intensity 
of alcoholism in the parent, as the intelligence of the child diminishes; the least 
alcoholic parents have the less intelligent offspring. If we treat the table by a 
fourfold division, between “good” and “medium” intelligences we find for the corre- 
lation of parental sobriety and intelligence in offspring the value r=—-03. It is 
clear that in the “excellent” group some special cause is at work, which gives the 
regression its non-linear character, and leads to a correlation ratio » of ‘086. The 
same want of regular increase of intelligence of offspring with sobriety of parent is 
found in the case of parents and daughters, where the correlation coefficient is zero 
and the correlation ratio 7 =‘102, the “excellent” as well as the “dull” and “defec- 
tive” groups of daughters having in this case the more alcoholic parents. It is 
clear that the relation of alcohol in parents to intelligence in children is not of 
a close kind and the association, such as it is, is not of a continuous character and 
apparently different in the two sexes. 


(F) Health of Daughters and Alcoholism of Fathers. Miss Elderton has 
provided me with the following table: 


Health of Daughters (Manchester Data), 




















| | | P] +s ee Y “ie 
i | moe . . | Phthisical and Died f ; 
| Healthy | Fair | Delicate | Epileptic | Young Totals 
Fa | | 
Z Alcoholic aes 104 re 24 8 38 184 
3 Non-alcoholic ... _ | w | 41 83 453 
<a 
| 
‘Totals 358 26 83 49 121 637 
= = =, < ee seat 2 
* Reduced to a common standard: Z,/c,= +°1193, %/o¢,= ~°1581, Z3/c,,= — 0989, Z/o,= — 0989, 


Z5/0,= — 0137, Z/o,= +°1567. 
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Boe = + 5567. 
Here* @/o,=+°5519, &,/o,=+°2934, %,/o,=+ 5559, 
%/o,=+ 9811, ~Z,/o,=+ °4844. 

The correlation ratio for this table is 7 =‘133, but a little examination shows 
that there is no definite association of poor health in child and alcoholism in 
father. Of the healthy children and of the delicate children the fathers have equal 
intensity of alcoholism; the phthisical and epileptic children come in an over- 
whelming proportion from the sober parents. The parents of those who die young 
are somewhat more alcoholic than those who survive. The small difference may 


well be due to more accidents and not to worse health in the children of alcoholic 
parents. 


(F) bis. Health of Sons and Alcoholism of Fathers. Miss Elderton’s table is 
as follows: 


Health of Sons (Manchester Data). 











i oF | | oe . 
; a mi Phthisical and Died we 
Healthy | Fair Delicate Epileptic | Young Totals 
Ez | | | 
2 | Alcoholic ae role 2 | 8 42 205 
= Non-alcoholic... 328 =| 37 | 71 | 37 | 133 606 
é | | 
| 
Totals . 450 | 46 | 95 | 45 | 175 811 








Here+ %/o,= + 6658, and for the arrays: 

@,/o,=+°6095, %,/o,=+°8573, %,/o;= "6662, 
#,/o,=+°9239, %/o;=+°7063. 

The correlation ratio is here n="089, but the correlation coefficient obtained from 
a fourfold table of healthy against the remainder is — ‘06, or the children of alco- 
holic fathers are the more healthy. It is quite easy to see from the above results 
how little evidence there is for alcohol in fathers affecting the health of sons. As 
in the case of girls, but more sensibly so, the healthy children have fathers some- 
what more alcoholic than the average; the delicate children have fathers not less 
sober than the average; the phthisical and epileptic children have fathers markedly 
more sober than the average population, and the early deaths are slightly more 
frequent among the children of sober parents. It is difficult to see in either of 
these tables evidence for alcohol as a chief source of bad health or poor intelligence 
in the offspring. 

(5) As a last illustration I take certain statistics of the severity of attack in 
cases of smallpox and the strength of immunity provided by vaccination. I owe 
the following data to Dr John Brownlee of the City of Glasgow Hospital. 

* Reduced to a common standard: %,/c,,= +°5470, Zo/o,= +°2908, Zs/o,=—+°5510, X/¢,=— +°9725, 
%5/o,= + 4801. 

+ Reduced to a common unit: %,/¢,=+°6071, Zo/o,= +°8539, Zs/o,= +°6635, X/o,= +°9202, 
%/o,,= +°7035. 

33—2 
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(G) Severity of Attack and Strength of Immunity due to Vaccination. I look 
first at the data arranged as a two row correlation table. 


Severity of Attack. 

















d | wey e 

= Haemorrhagic | Confluent | Abundant | Sparse a ery | Totals 
3 } parse 

& ' 

5 we ~ . | 

cs) Since vaccination ® ‘ ‘ = j 

3 ay spe 5 38 100 | 176 | 148 487 
g | Over 25 years or | 44 251 388 308 211 1202 
= unvaccinated  { 

7) 

F | Totals 49 289 508 484 | 359 1689 
by 








Boe = + °5583. 
Here* ®,/o,=+1:2700, %,/o,=+1:1194, %,/o,=+°7185, 
@J/o,=+ ‘3488, %,/o,=+ ‘2218. 

From these results we find »=°321, a very sensible relationship, and we see 
how the mean vaccination immunity increases as we pass from haemorrhagic to 
very sparse cases. There is very little doubt that we are dealing here with a 
perfectly graduated and continuous cause and effect, remarkably different in its 
character from the alcoholic influences we have just discussed. 


To illustrate this point, I proceed to investigate the same data arranged as a 
three rowed table, using formula (ii). We have: 


Severity of Attack. 














5 Haemorrhagic | Confluent | Abundant | Sparse Ptio Ae Totals | 
= Sparse 
'S 0-25 years... 5 38 120 | 176 | 148 487 
& | 
> 25-45 years... 29 | 155 299 268 | 181 932 
ey 5 | 
= | Over 45 or un- | 15 9% | 89 40 | 30 270 
a vaccinated  § 
dd 
oe Totals 49 289 | 508 | 484 359 1689 
| 
We havet: @/o,= ‘9951, @’/o,= °5583, h/o,=15533; 
Z,/o,= °5069, &//o,=1:2700, h/o,=1°7769, o,/o,= ‘8742; 
%/o,= “4329, %//o,.=11194, h/o, =15523, o,/o,=1:0007 ; 


Z,/o,= ‘9338, 2'/o,—= ‘7185, h/o,=16524, 9,/o,= “9401; 
%/o,=1°3875, @//o,= 3488, h/o,=1°7363, o,/o,= °8946; 


%,/o;= 13815, %;/o,= ‘2218, h/o,=1°6032, o,/o,= ‘9689; 


* Reduced to a common unit: %,/c,= +1°2028, Z/o,= +1:0602, Z3/o,= + °6805, X/o,= +3303, 
%/¢,= +°2101. 


+ Reduced to a common unit : %,/0,,=*4431, Z%»/o,,='4332, Z3/0,=*8779, X4/o,,=1°2413, Z/o,=1°3385. 
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leading, according to our horizontal division, to »=°319 and »/=‘337 with a 
mean value ‘328, in excellent agreement with the two row table value ‘321. 


Lastly the whole data has been taken as a contingency table: 


Severity of Attack. 

















| ee | | “ame | 
5 | Haemorrhagic | Confluent Abundant | Sparse | - ery | Totals 
a parse 
3 
8 | | 
E 0—10 wd 1 6 ll | 12 30 | 
Se 10—25 5 37 114 165 | 136 457 | 
25—45 29 155 299 268 |} 181 932 | 

s Over 45 11 35 48 : ed) oe 155 
= |  Unvaccinated 4 61 41 7 2 115 | 
L 

= | | | | 
be Totals 49 289 | 508 | 484 |} 359 1689 | 





Worked out as a 25-fold contingency table by mean syuare contingency we have 
C, = *335, 


which lies between the two values of » found from the three rowed table by the 
new method, and is within the probable error of either. This agreement is very 
satisfactory evidence that it was legitimate to apply the new method to a case of 
this kind, and that in this particular instance the material was closely normal. 


(6) The illustrations given will, I think, show that a wide range of problems 
can be dealt with by the new method, and that results found by it are closely 
comparable with results obtained by other processes. It does not assume linearity 
of regression, and in fact as in the examples on alcoholism it brings out in a very 
effective way the deviations from linearity. The hypothesis used is: that in the 
ease of the variable with alternate categories, we can suppose it continuous and 
sufficiently normal in character to have the mean values found from tables of the 
probability integral. If the arrays are not sufficiently homoscedastic for us to 
replace their standard deviations by their mean value, then we must use if we can 
a three and not a two rowed table. It is always desirable to have if possible such 
a table, because it enables us to test the nature of the variability in the arrays. 
Our last illustration, however, shows that very considerable variations in the 
standard deviations of the arrays do not sensibly modify the result obtained by 
using for those standard deviations their mean value. On the whole this new 
method, which replaces the fourfold table method for cases in which the latter 
does not give a unique answer, is, I think, likely to prove of some service. 











ON THE EFFECT OF A DIFFERENTIAL FERTILITY 
ON DEGENERACY. 


By KARL PEARSON, F.R.S. 
A NEW YEAR’S GREETING TO FRANCIS GALTON, 1910. 


(1) I HAVE indicated in several earlier papers* the very marked changes 
which are produced when there is correlation between fertility and any character 
in a species. I have termed the changes which result in the mean and variability 
of the character genetic or reproductive selection, and have shown that if such 
correlation exists reproductive selection may wholly defeat or largely neutralise 
the results of natural selection. In actual wild life, it is very difficult to find any 
character sensibly correlated with fertility, and it would appear that the low value 
of correlations of this character is an essential condition for rapid progress by 
natural selection. Nature has in some way—at present not clear to us—prevented 
this reversal of the survival of the fitter by suppressing all marked correlation 
between the physical characters and the fertility of a species. Half a century ago, 
I think, this absence of correlation between fertility and physique would have 
been found to hold for man in this country. It is practically certain that it does 
not hold to-day. Artificially a differential fertility has been created; the better 
mental as well as physical characters can be shown to be associated with a lessened 
fertility and a reproductive selection has been called into play, which not only 
impedes, but possibly reverses natural selection. The object of the present paper 
is to obtain—at any rate to a first approximation—some measure of the secular 
changes in a race which must flow from a correlation of fertility with any character 
of an organism. The problem in the case of bisexual reproduction is not a wholly 
straightforward one. It is influenced in the first place by the intensity of assorta- 
tive mating in the species. In the next place we have to ask: Does the fertility 
depend on the intensity of the character in one or in both parents? And lastly 
we must find a reasonable form for the relation between fertility and the intensity 
of the character. These points will be considered in the following sections. 


* See Phil. Trans. Vol. 187, p. 258; Vol. 192, pp. 259, 314; R. S. Proc. Vol. 59, p. 303. The 
Chances of Death, Vol. 1. p. 63 et seq. 
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(2) Wewill suppose that some character—e.g. grade of intelligence—influences 
the fertility. We have then to consider whether this character in the male and 
female influences equally or unequally the fertility of the pair. It has been 
asserted by some that academic training in the woman lessens her power of child- 
bearing. In this case the more intelligent women would have fewer offspring, 
but owing to assortative mating such women marry the more intellectual men, and 
whether intelligence was or was not associated with a lesser grade of fertility in 
the male, assortative mating would handicap the fertility of the intelligent male, 
and the more intelligent males would be practically less fertile. 


Let &, and & be the deviations from their respective sex means of male and 
female for any character, o, and o, the standard deviations with respect to this 
character in the two sexes, represented by the subscripts 1 and 2. Let r,. be the 
correlation of £, and &,, i.e. the intensity of assortative mating. 


Then it is reasonable to assume that the fertility of the pair is some function 

of w, where « is a linear function of &, and &,, or: 
&.. & 
coz 


L=—+Cy 
a 


The mean of « is clearly zero, and we can free ourselves from the influence of 
either parent by putting c, or c, zero. If fertility were related to x by a simple 
linear correlation then we should obtain, it is well known, the highest correlation 
of fertility (y) and x by taking (the subscript 3 denoting fertility) : 


Tis — TieN 2s Y23 — Tie is +: 
| C= ———, eccvccvccccccccccccccces ll 
: 1-1? ’ » 1-1? (u), 
or proportional to these quantities. 


We need not at present however assume any 
special values for c, and ¢,. 


Further, it is desirable to suppose that the curve of 
mean fertility for each value of # is not necessarily linear, but of a more general 


type, allowing us to make fertility a maximum at other grades than the extreme 
values of the character. 


(3) This leads us to the next point. What law of fertility seems reasonable ? 
We want a law of fertility which will allow closely of the fertility (a) increasing 
nearly uniformly and at any given rate with the character, (6) decreasing nearly 
uniformly and at any given rate with the character, and (c) being concentrated 
with any degree of intensity round any grade of the character, and falling away 
on both sides of this maximum. All these conditions are fulfilled if we assume 


the mean fertility y for any grade of the character 2, not to be given by a line 
but by a normal curve, e.g. 


ea ET TT wcsiresittee ecarsuabiell (iii), 


Here by a proper choice of o, and k, we can make the distribution of fertility 
fall or rise with the character (i.e. k positive or negative and both k and a, very 
large,—k so taken that the centre of fertility lies outside the region of the range 
of values of x), or concentrate the fertility on any grade of the character (i.e. make 
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k equal that value, and take o, small). It appears therefore that the distribution 
indicated, if not ideal, will still serve as a first approximation to the type of cases 
likely to occur. 


There is, however, an argument of still greater weight for the suggestion that 
a fertility curve of this kind cannot be widely removed from the actual state of the 
case; namely, when we measure almost any human character (i) in parents and 
(ii) in their offspring we find both distributions very approximately normal. This 
could not possibly be the case unless the fertility curve were itself approximately 
normal. We include of course under this the case in which o, is indefinitely great, 
or the fertility is not associated with the character in question, which is probably 
what we may describe as the organic condition of affairs. Of course in actual 
working we have to regard many special features. Tor example, in the matter of 
intelligence, it is fairly obvious that the abler men and women have fewer children ; 
but is it true that the fertility falls again towards the extremely dull? This is a 
point which needs careful consideration ; I think it probable, especially when we deal 
with net and not with gross fertility. Extreme mental defect leads to neglect of 
offspring, to life in institutions and limited marriage. For example, pauper 
fertility is higher than criminal fertility. I shall illustrate later this point. 


(4) Let us now turn to the direct problem. A parentage has the deviation « 
of the character and o, is the variability of this deviation; the deviation in the 
offspring is z with a variability o, and a correlation p. Then, if there were no 
weighting with a differential fertility, the frequency surface would be 


a. 2 x2 2prz 2 
~31--\c2~ ow, ‘a? 
U=WEe * f 2 x72 2 


in the usual way. But we must multiply this by the average fertility of each 
grade to obtain the frequency distribution. This will be 


3 1 (x — k)? 3. a (= 2pxz . 2 ) 
vane 2 © xe BLP NCP Oy ST) «Loic ccccccccccccccccccs (v) 
a ee 
mue 308 xg BIB eet at) ~~ ot eure oN «(vy bis 
t 24 (a -my)* _ 2 (#—my) (z—my) RK. (z-my)*} 
=u xe 21-R( 2 2124 OF Whceatiecesase (vi), 


where »,' is a fresh constant, i.e. independent of # and z, and m, and m, are the 
means of effective parentages (i.e. weighted parentages) and of actual sons, =, and 
=, their standard deviations and R their correlation. We have by finding the 
centre of the elliptic contour of (v) bis in the usual way: 


Mg 2 TO GPF cavicnccsnicssenvnsusesisesiees (vii), 


My = M4pa;/ Fz 


The first of these equations shows the shifting of the parental mean, the change 
being due to differential fertility; the second equation shows that the shift of the 
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offspring mean exactly follows the shift of the parental mean, or the mean of the 
offspring is the offspring of the mean parent. This gives at once the secular 
change due to weighting the parentages with their fertility. 


Further we must have 
1 1 1 1— 
7 Da\sa 2 ( “Ba P ) ’ 
(- kf) te 1- pp? \o, oe 


a 1 (= 
Q—k)>2° 1p? Ai 
eee ie 
(1 — R)>,>, 1 p?oz0,° 


From these flow readily 


Ra <n il eiarvensnsonweewnivershiy cunts (ix), 
J hed 
on 











These results show us at once that differential fertility: (i) reduces the variability 
(>,) of the offspring, (ii) reduces the variability of the effective parentages, (iii) re- 
duces the apparent correlation between parentage and offspring. It should be 
noted that the non-differential fertility is obtained by putting o,=0; thus 
any differential fertility up to ¢,=0, which denotes breeding from a single grade, 
will give lowered values for R, =, and >). 


While the variabilities and correlation are thus changed, it is worth noting 
that the regression of offspring on parents, ie. R>./=,=pe,/c, and is therefore 
unchanged. Further the equation to the regression line is 


2—mMm,=p 5 (a —m,), 
x 


or since m,= m,pa;/o,, it follows that 


or is identical with the regression line for non-differeutial fertility. This is, of 
course, only a case of the general principle found long ago* that selection of a 
character A does not alter the regression of a second character B on A, but it is 
interesting to note that the effect of differential fertility is as it were a pushing of 
the population down its regression line. 


* Phil. Trans. Vol. 197, pp. 20, 21. 
Biometrika vir 34 
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(5) We may next inquire into the mean fertility and variability of fertility 
in the population. 

The fertility of the parentage « is 


ee ee cilia 25, ROR PD LOTTE. (iii) 


% a eaanelcantey ge, econ 


and the frequency of such parentages is given by 
‘eee 
V 2a oe 


where N is the total number of such parentages. 


w 


Hence, if f = the mean fertility : 





1 ja?  («-k/ 
ae +x pail —— 
vf=| oe e 3 las oo" | ae, 
-ao 27 a. 
and accordingly : 
1 2 
f I ORR cee (xii) 
\ oet+ Oy 


This is one equation linking up the indeterminate quantities y,, 0), k with 
observable quantities like f the mean size of family in the population. We can 
now find the standard deviation >, of the size of families. Clearly: 


>= Sum (y*)/N — f?, 


a +a 
N (22 +f?) = [ wyda 





1 k2 
=- Nythoo_ ée ~ 2hee+02 
V ta; +o,? 
as before. Hence: 
ke ke 
>, =ye 7 %% ev t 20, . A , mE ot+o,2 
VOq + 206, ay + o,° 
2 Ko? 
Ff =| =!) pee ee (xiii). 
\f a, Vo2+ 20, 


V; is an observable quantity and this equation may be used to find k if o, has 
been otherwise determined. The previous equation will then give y. We 
should have to go to the third moment coefficient to determine y,, o, and k 
from the fertility distribution, and it may be doubted whether the result would 
be of much value; it seems better to endeavour to determine o, and k from 
observing the average fertilities of those exceeding certain grades of the character. 


(6) Determination of the constants of the fertility distribution from the 
average fertilities of those possessing more than a certain grade of the character. 


| 
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Let », be the number of the population, N, of pairs of parents who exceed the 


value h of the character 2; then: 





ny, = es 
V°rox 


n/N = - 


a 2 
—172/¢g 2 
| en Io, dz, 
h 


1 [" ae 
—_—_— ée 2° 
NV Qr hic, 


This equation determines h/c, from Sheppard’s Tables, since n,/N is known. 


Further if f, be the mean fertility of those over grade h: 


= eine 


— 42? 
Sh = pees l 


96o.2+62 
Yoe 409° + Cy 


V20 ox I, 
eae [ 
N20! wr 
fa: eS 
cod \ Cy + Ox 


H’ (h— 


F where 
TxFy 


2) 


ox ydx, 


Lox e k \ 





Oo + 0," 


Now mn ful Nf is clearly known, therefore H’ can be found from the tables. 


We will suppose two grades of the character to be h, and fA, and the corre- 


sponding average fertilities f,, and f;,, then H,’ and H,’ 


there results 





, Tx at i ko, 
H, Ps a me ies o, 
ees a a Rn. oe 
Voe aL Cx oy + Ox 
HY -H, Voe?+o2 
h,—h. Tx 
oC. 
Tl S “4 0 — a - 
ius o 7 /H, — H.\ me 
Vj % ( h, — hy / 
: Hy’ (hy — hz) a a Big: 
Further: H/-H; ~ h=—k nent 
wus 








oa’ = HY 


will be known; and 


, ox (H, — HP —(h,- hy 
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Hyh,— Hyh, 1 : 
Or, finally : k= H’~H, oe ee (xvi). 


x o,°(H,'—H,’P 


Thus o, and & are determined, and f being known, we can find y. In fact 





Bon’ = § {(Hy’he - Hy'hy)/o,}? 

- 1 — +442 (Hy -- Ho’)? — 4 (ly — hg) /o,}? *s 
= ‘ er oe fe Te tpeenee ears XV11). 

y=f (hy — he) / ox =) 


Thus the coastants of the fertility distribution are fully determined. 





(7) Determination of the constants of the fertility distribution, from a know- 
ledge of the fertility of certain grades of the population. For example, when 
£L=—M oz, let the fertility be y,, and when e=m,¢,, let the fertility be y. 


Then 
<3 (mezt*\ 
Yi = We 2 oH . 


3 : (m2 - =) 


Yo=Yo@ ~“\ 4 


Let oz/o,=r, k/o, =p; we have: 


/ ey eH) 





loge =m 2+ p, 
[T= WED op 
log e 
_1__{_ /2 (log y= logy) , / 2 (log x — log ys) _ , 
Mm, + Ms log e \ lug e j ay 
y ose X VEE 


ss > ea (log yy — log Ya) _ me p? (log y% — log Yo)) = E 
Mm, + Mz log e \ log e j 
The signs of the roots are arbitrary and must be selected so as to accord with the 


needs of the problem. For example, if m,=m,=m, either Am or w may be 
equal to 


iv 2 (log yo—log yn) 5/2 (log yo — log ys) 
= c eee Bz i 
log e om log e 


and which we choose depends on whether the maximum fertility lies between the 
two grades or outside and beyond the grade of larger fertility. 


The above equations assume that y, is known, it is the maximum average 
fertility of any grade in the community. A third equation to determine y, from 
the average fertility of the whole community is provided on p. 262, Equ. (xii). 
But the equations then become troublesome and can only be solved by approxi- 
mation. It will be found also that small changes in the fertility curve are not 


very influential in modifying the main results to be drawn from the equations 
set forth in this paper. 
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(8) Numerical Illustrations. I propose in this section to illustrate the main 
results of the above investigation by selecting possible values for some of the 
constants. I am far from assertmg that these are actually existing values, but 
they appear not improbable values—say, for the case of ability. We are only at 
present acquiring knowledge of the relationship between ability and fertility in 
the case of man, and the examples I give are merely illustrative and intended to 
indicate how the algebraical formulae can be dealt with. At the same time, I 
think, they throw some light on rather urgent national problems. It is desirable 
to bring home to the minds of the thinking classes what it really does mean, if the 
fittest in any character have not a third of the fertility of the least fit. 


Illustration I. The upper decile of a population has an average fertility rate 
of 2, the 50°/, showing the lower values of the character an average fertility rate 
of 6 The fertility of the whole population has a mean value of 5. Find the 
changes in the population during one generation. 

Probability integral tables give at once h, = 1'2816c;. 

To determine H,’, we have 

Mn, X fi,|N x f= ‘1 x 2/5 = 04, 
therefore Hy = 1°7500. 


Clearly h,’ =0, ma, fi n/N x f=5*x 6/5 =*6, from which the tables give us 
H’ = :2533. 


A rough diagram indicates at once the sign of the quantities and shows 
that we must have 


HY =h, + o2k/(o2+0,") 


ics Hy = o2k|(o2 + 02) ~ he 
or in this case 


1:'75000, o/Voe+ o, = 1:°28160,+ oz k/(o72+ 022), 
'25330,0,/Va + 27 = oxk|(o2 +o); 
or, subtracting : os 
1:4967o,0,/Vo,2 + o22= 128160, 
whence o,= 1'6579e,, 
and on substituting 


k=°‘8181oe;,. 
The equation 
1 ke 
f= ee @ 2o%+0,2 
Vo. + oes 
will now give us y. On substituting the values of o, and k and 5 for f we find 
Yo = 6°439. 


The distribution of fertility is accordingly given by 


ape 
y= 6°439e 2\ 165790, 
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It is clear therefore that the fertility rises to a maximum at ‘813lc, and then 
falls again, so that the lowest values of the character have a lowered fertility, but 
are still in excess of the corresponding high values. Thus m,, the mean of the 
effective parentages, is given by 

m, =ko?/(o.? + 627) ='216902, 
measured below the old mean, and 
>, = standard deviation of effective parentages 
= 0)0,/Vo,) + a2 = '85630z, 
or, the effective parentages have 16 per cent. less variability than individual 
parentages, i.e. parentages unweighted with their offspring. We can now find 
the average fertility of the lowest decile of the population: 








a wx 1 /2"\2 
Nf Qe qd pa 
= a adie da’, 
QarJ « 
+ 3 _hi-m 
where q=q/2.=- 
“1 
1:2816 — ‘2169 
ee > — = ]°2 oT. 
8563 + oaes 
Hence Nn, JM, _ 1 Sr, = 10686, 
Nf 10 5 
or, Fn, = 5°34; 


that is to say, that while the average value of the uppermost decile’s fertility is 
only 2, that of the lowermost decile is 534, ie. is still in excess of the average 
fertility, 5, of the population. Indeed for only about 6 per cent. of the population 
with the very lowest values of the character does the fertility fall below some 
4 per cent. 

If we take WN as before for the number of pairs of parents, then the distribution 
of effective parentages is given by 
» 1 /x — *2169¢,\2 
SN 1 % ( ae 


= —— — é 2 
NV 2a ‘85630, 

We have now to consider the distribution of offspring from these effective 
parentages. We have to consider p of p. 260 in relation to cz and o;. Four 
hypotheses are possible : 

(i) We may suppose that the character is equally influential in the case 
of the fertility of both sexes; thus brain capacity and resulting intellectual energy 
might mark equal reduction of fertility in both sexes. In this case the proper 


value for « is* 
one, (B45), 
0} o>/ 


* Here as later it must be borne in mind that a constant factor is of no importance. 
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This gives at once a, 
Cz= 6,2 V1 +12, 


t: ——— 
Pez = 5 (Pat Pa)/VT Fe 


Here if the population were stable we should have o,=o;,, 1 and 2 representing 
as before mother and father. According to our present knowledge 1,,=1.,="4 to 
‘5, = say, ‘45, and r, lies between ‘15 and ‘25, say, ‘2. We have then 

Ox = a, X 15492 
and p,zz, = p of our formulae on pp. 260—1, = 5809. 

(ii) We may suppose that the character is equally influential with the fertility 

of both sexes, but that there is no assortative mating. In this case 7,,=0, the 


coefficients of = and es must still be equal and we have at once 
1 2 
Oz =0, x 14142, p= "6364. 

(iii) It is conceivable that the character affects the fertility in one sex only 
and that there is no assortative mating. In this case we may take «= &,, and if 
the population is stable a, = o,, while p = °45. 

(iv) The character may be supposed to affect one sex only, but there may be 
assortative mating. In this case, returning to our formula on p. 259, r,,=0 and 
az may be taken proportional to 


1 &, Tetis £, 
l=n2 0, l—#,3 oa,’ 
or we may write 
(é E 
t= 0; r; i 
o7; Os 
which leads to 
ae Vig — NreT oz 
Oy = Oj V1 = Pie's and Paz = ; 
Vl —1," 


or on the same assumptions as before : 
o,=0,x ‘9798, and p='3674. 

In all these cases o, and p are of course the variability of offspring for the 
given character and their correlation with the parentage character on the as- 
sumption of one child to the parentage. We now proceed to discuss the constants 
of the offspring distribution as resulting from effective parentages, i.e. the quantities 
Mm, =, and R of our p. 261. Here m, is the reduced mean value of the character, 

Constants of Offspring Distribution. Illustration I. 





| Hypothesis ... (i) (ii) (iii) (iv) 
| m/e, - “1260 1380 | -0976 0797 
| Se/o, ve 9539 9445 ‘9747 9818 


| Rip . | 8076 ‘9065 8805 | “8721 
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ie. the regression owing to differential fertility, =, is the variability of the offspring 
generation and should be compared with o, or o,, while R is the correlation 
between offspring and effective parentages. 


An examination of this table shows the following general results: (a) the mean 
value of the character in the population is reduced 8 to 13 per cent. of its standard 
deviation according to the hypothesis we make with regard to the influence of the 
character on fertility. That is to say, the mean character of the population stands 
one-third to a half-decile lower with each new generation. (b) In the next place 
the variability is lowered two to five per cent. The probability therefore of 
extreme deviations is sensibly reduced; but this reduction tells only on excess of 
the character because the mean of the distribution has been markedly shifted 
towards defect of the character; in other words, if extremes of the character 
in excess correspond to noteworthy men, they are much less likely to occur in 
a population where this differentiation of fertility has set in. Suppose we term 
one man in a thousand “noteworthy,” then such a man corresponds to the grade 
3090, in our original population. His position in the offspring generation will be 
3°216 
9539 

But this corresponds to four such individuals in 10,000, or 1 in 2500; or there 
will be less than half as many noteworthy men. This is on the first hypothesis ; 
the second hypothesis shows only 1 in 3300, and the fourth or less influential 
hypothesis 1 in 1700. It will therefore be obvious that if mental powers are 
hereditary, any differentiation of fertility will not only alter rapidly the average 
intelligence of the community, but will produce that dearth of noteworthy ability, 
or of genius, which is fatal to the continued existence under modern competitive 
conditions of a great national life. The whole system of curves corresponding 
to this first Illustration is given in Diagram I. 


Meo a 30900, = 32160, => = => 3:37 : 


Illustration II. In the previous illustration we considered a case where the 
maximum fertility did not coincide with the minimum of the character, although 
the lowest values of the character had markedly higher fertility than the highest. 
It will be of interest to determine what, if any, differences arise in our general 
conclusions, if we fix on the fertility of individuals high and low in the scale, 
instead of on average fertility values. We will suppose that the individual 
parentage which stands sixth in the 1000 for the given character has only a fertility 
of 1°5, and that which stands last but six has a fertility of 65; the maximum 
fertility of any grade shall be 7, and this shall occur below the grade of the 
individual parentage with a fertility of 65. Using the notation of p. 264, we may 
take m, = m, = 2°5, say. Hence 





2 (log 7 — log 1°35) 
as -= 9°5 = : 552 
. / 434,2945 5A + w= + 175525 


2 (log 7 — log 65)... Be et 
+ “ 434,2045 = 2°5X — w= + 38508. 
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Since X must be positive, these give us 
25X=°6851, w=1:0701, or 25X=1°0701, w='6851. 


In order that the maximum fertility may be outside the range — 250, to 2'5az, 
we must take the first pair of roots. We easily find 


o,=36490c,, k=3'9049c,, 
and from equation (xii) f= 42613. 


Hence the fertility distribution is 


_ 1 /x-3-90490,\2 
y=Te 2\ 36490c, 


The maximum fertility 7 occurs so far from the mean value of the character, 
that there are practically no parentages with a low value of the character and 
a low fertility, i.e. the number of individual parentages with an extremely low 
value of the character and a fertility under 7 is only 1 in the 20,000. We may 
therefore look upon our curve as one which sensibly describes a fertility constantly 
decreasing as the value of the character increases. 


Turning to the distribution of effective parentages we find : 
m, ='27280,, 2, ='96440,. 


Thus there is rather a greater shift than in Illustration I. of the average effective 


parentage, but the parentages are more variable. The distribution of effective 
parentages is: 


w 


4:6 T 1 /x — *2728¢,,\? 
ree 42610 ee > ( 96440, ) 
V 2a x ‘96440, 


The number of effective parentages with low values of the character and fertility 
less than the maximum will be found from determining the area of this w curve 
for values of # greater than #=3'90490, —‘2728¢, = 3'6321¢,, or by looking 
up in the Tables 2 = 3°6321/9644 = 3:77; this gives 1 in 12,500, a negligible 
frequency. 


In view of our previous results for Illustration I, it may be of interest to note 


the average fertilities of the top and bottom deciles of our individual parentages, 
they are to be found from : 


MF = | ie [ _ hada’ 
N ft / Qa 
where » h—m, _ + 12816 — 2728 

ak Sie. ne, 
whence by aid of the tables we find 2°28 for the average fertility of the top decile 
of the individual parentages, and 6°30 for the bottom decile. These values may 


be compared with the corresponding 2 and 5°34 of Illustration I. 
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We now turn to the distribution of offspring. We have: 
m,='2728p0,, L,=a,V1—-06986p*, 
R/p = 3°6490/V 143152 — p®. 


We can now determine these in precisely the same manner as they were tabled 
for the four hypotheses in Illustration I. 


Constants of Offspring Distribution. Illustration IT. 
—— a * cL 5d SPE ne Solr 





| Hypothesis ... ji) | (i) (ii) =| (iv) | 
oor -ee° aes 
| melo, .. | °1585 1736 | #1228 «| «= -1002 | 
| 3e/o, ... | 9881 ‘9858 ‘9929 | 9953 | 
| Rip | +9759 ‘9783 | -9713 9689 | 


We see at once from this table that whichever hypothesis we take the 
variability of the new generation is only lessened about 1 per cent. and the 
correlation about 3 per cent., values markedly less than in the case of Illustration 
I. This is obviously due to the fact that the fertility distribution is markedly 
less concentrated. But the shift of the population average is now 10 to 17 per 
cent. of the standard deviation, or roughly the average has receded through $ to 3 
of a decile in the character. We may measure the extreme effect of this by 
inquiring how far “ noteworthy” individuals, the individual one in a thousand of 
the old generation, have been reduced in number. What we have to find is the 
value of (m, + 3°090¢,)/>, ; for the first hypothesis this is 3°288, which corresponds 
to 1 in 2000, or the halving of these noteworthy individuals in the second 
generation, and we have practically the same rate on the second hypothesis. We 
thus see that on the numbers of this second Illustration, the average population 
will degenerate considerably faster, 25 per cent. faster approximately, but the 
dearth of noteworthy individuals will be slightly less. The reader must not, 
however, conclude from these numbers that our second Illustration represents 
a less undesirable type of degeneracy. Beside the dearth of the noteworthy, 
the multiplication of the extremely unfit has to be measured. Let us suppose 
that the individual whose character has the least value in the 1000 of the original 
population is by virtue of this deviation a non-useful member of the community— 
physically or mentally unfit. We must clearly then find out in the second 
generation to what number of the population this standard applies, Le. we want 
(3°090e, —m,)/=.. This is equal to 2967 on the first and 2°958 on the second 
hypothesis, corresponding in both cases to 1 in 667 instead of 1 in 1000. On the 
other hand it will be found that in the first illustration the corresponding numbers 
are 3°107 and 3125 on hypotheses (i) and (ii) respectively, giving practically 1 in 
1000, and the same standard of unfitness repeating itself in the second generation. 
Thus Illustration II, with a slightly reduced rate of decrease of noteworthy 
individuals, has an increased rate of unfit individuals, when compared with 
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Illustration I, and a general or average decadence greater by 25 per cent. It 
would be idle to assert that one or other corresponds to a greater racial degeneracy 
until we have settled whether it is*more important to a nation to have in the case 
of a desirable character (i) a high average value, (ii) a larger percentage of note- 
worthy individuals, or (iii) a smaller percentage of unfit individuals. We do not 
know at present what relative weight is to be given to these categories ; but we 
can see that in any quantitative theory of the socially desirable characters in a 
community, they naturally arise from the analysis of a differential fertility. Such 
factors as dearth of leaders, lowered general intelligence of the community, multi- 
plication of the unfit, are seen, even by a preliminary analysis of this kind, noi 
to be unrelated causes, by aid of which the historian accounts in a vague manner 
for racial and political changes. They are associated factors, explicable and 
measurable when we know the outlines of a theory of differential fertility. 


No special stress must be laid on the particular numbers chosen above to 
illustrate this discussion. They are round numbers not wholly inappropriate, 
perhaps, to what we know of intelligence and fertility in man. They are used to 
suggest the main outlines of a discussion of such problems. Considering the 
immense masses of statistical, especially demographic data now being accumulated 
in all civilised countries, it seems to me that the historian of the future will have 
the means of testing in a way, never yet feasible, the changing fitness or unfitness 
of nations. In that future absolute and differential fertility, the abundance or 
dearth of leaders, the average intelligence of a nation, the reduction or multiplication 
of its unfit, will be recognised as the basal factors in racial progress, the biological 
sources of evolution in political history. Then history becoming biological will for 
the first time be admitted as a branch of scientific inquiry, and will also for the 
first time provide the requisite training for the statesman. 


If we turn to examine the possibilities of immediate work with regard to 
differential fertility, there are several directions in which investigations can at 
present be carried out with reasonable hope of success. If we confine ourselves to 
the artizan class, wages are a very reasonable measure of capacity, and the corre- 
lation between these and fertility or fecundity* might fairly easily be ascertained. 
Intelligence might be measured by craft classification extending from engineer to 
general labourer. Much may be done, if we pass from the artizan class alone, by 
comparing the size of families of members of various professions, of those following 
mercantile pursuits, of retail traders and of artizans. Further within a narrow 
class like the graduates of the Universities, it ought to be feasible actually to 
tabulate fertility against a scale of intelligence as measured by academic distinction. 
Fertility of sane and insane stocks, of mentally defective and normal stocks, of 
alcoholic and sober parentages can be ascertained, although the task, of course, needs 
careful social inquirers. Lastly grades of physical fitness might well be investi- 


* Fertility might be measured by nett or gross family when complete, or for marriages of 15 or 20 
years’ duration. It would, I think, be possible to measure fecundity by some function of the intervals 
between the births of the first three or four children. 
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gated ; the family of the athletic might be compared with that of the non-athletic ; 
and those of the smith and the navvy with those of the cobbler, tailor, and street 
hawker. It would, I think, be quite possible even without a reformed census, for 
which every sociologist should work, to obtain many results of value bearing on 
differential fertility by inquiries of the above nature. 

Diagrams. 

Each diagram consists of five normal curves. They are worked out on 
Hypothesis (i), ie. that the fertility of the parentage is equally affected by the 
character in both parents, and that there is an assortative mating in the community. 
The coefficient of parental heredity is taken as “45 and that of assortative mating 
as ‘20. w is measured from the mean of the character in the parentage of the first 
generation. Expressing everything in terms of o, the variability, say, of males in 
the first generation we have the following curves : 

Diagram I. Representing Illustration I. 


Curve I. Distribution of Males in first generation : 


‘ 1 x 
hee se » 2op 
y= ———— e 2a’,” 
y V2 o, 
Curve IL. Distribution of Individual Parentages in first generation: 
1 x? 
n 5 ae 
y = ————e 2 (1°550))? | 


NV 2 1'550, 


Curve III. Curve of Fertility of Individual Parentages of first generation: 


. 1 /x- 1:26030,\2 
y= 6439e 2\ 256970, ) 

(N.B. Only part of this curve is drawn.) 

Curve IV. Distribution of Effective Parentages in first generation: 


Bh : | (“5 "33620, \2 
y=- e  2\ 182780, 
NV 2ar 132730, 


Curve V. Distribution of Males in second generation : 
ea = 1 (* store) 
y= - e 2 95390, c 
NV 2er ‘95390, 
Diagram II. Representing Illustration II. 
Curve I. Distribution of Males in first generation : 


. 1 a? 
y= = e 3, 
N 2qr a, 


Curve II. Distribution of Individual Parentages in first generation : 
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Curve III. Curve of Fertility of Individual Parentages of first generation: 


1 (Sree. — 6°05260; 
y= Te 5°65600; ) , 
(N.B. Only one half of this curve is drawn.) 
Curve IV. Distribution of Effective Parentages in first generation: 
_ Fe “42280, 
1-49480, ) ; 
Curve V. Distribution of Males in second generation : 


1 /x —*15850,\2 
4: ‘261n ae ( ‘98810; ) 
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A STATISTICAL STUDY IN CANCER DEATH-RATES. 


By G. D. MAYNARD, F.R.O.8.E., Pretoria. 


As there is still much that remains obscure in the aetiology of Cancer, any 
new light that can be thrown on this important subject seems worthy of careful 
consideration. 


It is with a sense of great diffidence that I bring these figures forward, knowing 
full well how liable to error death-rates may be, and how easy it is to draw false 
conclusions from the study of figures. But when all is said and done it is to 
figures that the final appeal must be made, and if the death-returns are so 
unreliable as to be worthless, then it is certain that there are no more reliable data 
to be obtained. The comparatively few cases that come within any individual 
experience are, from the point of view of exact knowledge, of little or no value. 


If therefore we are to learn anything from the history of the past, it is to 
figures that we must go, guardedly and with great caution, bringing all the 
collateral evidence that we can obtain, to check and control our results, and above 
all using accurate methods of analysis before drawing any conclusions. 


Theories are frequently being advanced, in medical literature, in conflict with 
the records, these either being ignored, or set aside with the remark that they are 
so unreliable that no attention should be paid to them. Marked divergencies 
in the rates are dismissed as being due to errors in registration, the increasing 
cancer death-rate is regarded as being due to better diagnosis, and so on. 


Before entering on any discussion as to the causes of the varying cancer death- 
rates as observed in different countries—and even in the cities and towns of the 
same country—it will be advisable to make a few general remarks on how the data 
in this paper have been obtained and dealt with, and to answer a few of the 
objections that may be raised as to the value of the returns themselves. 


At the outset one must admit that many imperfections are inherent in all 
statistics of death which are not based on post-mortem examinations. But because 
the figures are not absolutely correct are we to neglect them, or regard them as 
absolutely useless? One must remember that where large masses of figures are 
being dealt with the errors in excess tend to balance those in defect, and wrong 
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inclusions in a group must be set against the accidental exclusions. For instance, 
is it reasonable to consider that in two cities of the same country, during the same 
period, and under similar Registration Laws, rates of 1184 and 446 per million 
are merely fortuitous or due to errors of registration? Such a suggestion means 
that either more than half the cases are overlooked in the one town or more than 
100°/, too many are recorded in the other, unless the error is to be divided 
between them. Before dismissing all differences in rates as being due to errors in 


diagnosis or imperfections of record it is well to realise exactly what this suggestion 
entails. 


Not only is it often said that better registration accounts largely for the varying 
rates in different places, but that much of the increase in the cancer death-rate, 
as also that occurring in the diabetes death-rate, is due to better diagnosis. The 
statistics show that both these diseases are very markedly on the increase. As 
a rule before death occurs both diseases give a very definite clinical picture, 
and run a more or less protracted course. When dealing with death returns, 
difficulties in early diagnosis are of no consequence. ‘These factors are in favour of 
a correct diagnosis being made, and probably fewer mistakes are made in the 
diagnosis of these groups than in almost any others. In the last 30 years there 
has been very little added to our knowledge, that would aid the general practitioner 
to a more accurate diagnosis, in the final stages of these diseases. Doubtful 
tumours are more often submitted to the microscope for confirmation of diagnosis 
than formerly, but the effect of this on the returns would rather be to exclude 
from the cancer group cases that would otherwise have been found there; for 
a fatal tumour of doubtful origin would most probably have been classed as cancer 
in the absence of definite knowledge to the contrary. There does not seem 
therefore, sufficient ground for believing that the increase in the rates is in any 
great part due to improved medical knowledge, or that this will account for the 
growth in the cancer and diabetes death-rates. 


If we refer to the cancer death-rates for “occupation groups” we find that 
those consisting of men who by their social status can best afford highly skilled 
medical care, or those who have State medical attendance provided, are the groups 
showing the lowest death-rates, while the labouring and agricultural classes show 
distinctly higher rates. This is the reverse of what one would be led to expect if 
the explanation that better diagnosis accounts for the increased rate—is the correct 
one. 


There is still a further reason for thinking that the group returned as cancer is 
fairly homogeneous in character. It will be seen from Fig. 1, how good a fit a 
Pearson Curve gives for the male death-rates, tabulated in five-yearly periods. 
The dotted polygons at ages 92°5 and 97°5 are the original observations and the 
constants of this curve are given below. Sheppard’s adjustments were not used as 
I did not expect to find high contact at both ends. Finding however that there 
was high contact I recalculated the curve, using Sheppard’s adjustments and 
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altering the two groups referred to above, as indicated in the diagram. As these 
groups are based on very small numbers, this step seemed justifiable. The general 
fit is slightly improved thereby*, although the form of the curve is not altered. 
In Fig. 2, the female curve is given. Here the observations are more irregular, 
which may possibly be due to the greater difficulty in obtaining correct ages in the 
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Fic. 1. Cancer Death-Rate, Males, U.S.A., 1900. 
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case of women. Sheppard’s adjustments were used, although the result shows 
that we are not justified in supposing high contact. The total number of cases 
employed in finding the rates for the various age periods was male deaths 11,436, 
female deaths 18,039. The areas of the frequency polygons represent the number 
of cancer deaths per million of population living at each age. 


* That the theoretical start of the modified curve falls so long before birth is, perhaps, due to the 
fact that there is really not high contact at the old age end after modification. 
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The constants of the curves are :-— 
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Cancer Death-Rate, Females, U.S.A., 1900. 
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Readers who have fitted curves to “disease” distributions grouped in this 
manner will I think agree, that a fit as good as that given in Fig. 1 indicates the 
homogeneous nature of this group. I have frequently been struck with the 
excellent fit given by diseases, like puerperal septicaemia, where the chance of 
a wrong diagnosis is unlikely, and greatly disappointed with the bad fit obtained 
by a disease that presents difficulty in diagnosis, such for instance as enteric fever 
in hot climates, where the heterogeneous character of the group is undoubted. 
The excellence of fit in the case of the male cancer curve is I think evidence of 
some value as to the homogeneity of these returns. 


The goodness of fit as measured by the usual y* formula* for the modified 
male curve is P = ‘4 and excluding the irregular groups at 2°5 and 107°5, P ="73. 

The curve in the case of the diabetes distribution is not satisfactory. No 
single curve can be fitted to the figures which will even approximately represent 
the frequency polygon. It seemed advisable to see if two or more curves could be 
found that would serve to express the recorded rates. It is fairly easy to see that 
these curves cannot all be normal. The old age component of diabetes is clearly 
skew, and it was not hard to obtain a good representation of this part of the 
mortality. The earlier mortality appears to consist of two components ; a diabetes 
of youth with a mean age of 15 to 20, and a diabetes of middle age with a mean 
age of about 40. These components would agree to some extent with those found 
by Pearson in his resolution of the general mortality curvet. The discussion of 
this point has been reserved for another paper. Further figures are needful before 
it can be definitely stated that the disease now known as Diabetes melitus can be 
certainly resolved into two or more age groups. The English death returns 


for this disease show the same general features as those obtained from the United 
Statest. 


Before any comparisons of the cancer death-rates can be profitably made it is 
necessary that they should be corrected for the age constitution of the population. 
Cancer is mainly a disease of advanced life, and as this is the portion of the 
population most unevenly distributed between town and country, the difference in 
crude death-rates due to this cause must be borne in mind when rates are being 
compared. Similarly no useful comparisons can be made between the rates as 
found for the different occupations until corrected for age. The group of bankers 
for instance has a very different age composition from that of clerks, as the 
following correction factors show: bankers=0'3516: clerks=1'3157. Thus the 
crude rates of 414 and 280 become with correction 150 and 368 respectively. 

The usual method § of obtaining the correction factor has been adopted, and 

* Biometrika, Vol. 1. p. 155. 


+ Phil. Trans, Vol. 186, p. 407. 
t The female diabetes frequency polygon exhibits the same features as the male distribution, the 
The early age portions of both cancer and diabetes frequencies require special 
consideration, which I hope to give on another occasion. 
§ R=death-rate in standard population, 7;, r2...7,,=death-rates at age periods, x, v2... x, =numbers 
of population in similar age groups, for district to be corrected, where X is total population. Then 
if R’=S (xr)/X, R/R’ is the correction factor. 


rates being similar. 
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the method by which the standard death-rate used was obtained is given in the 
Supplement to the 65th Annual Report of the Registrar-General, for the years 
1891—1900. The standard population is that of England and Wales for the 
same period. It is immaterial what standard is adopted so lung as all crude rates 
are reduced to the same base, as they can then be compared inter se, although 
they do not represent the actual rate obtaining in the district considered. 


The only source at my disposal from which it was possible to obtain the 
necessary data to correct the rates for towns and districts, was the United States 
Census of 1900. Fortunately this country covers a large area, and is subject 
to many varying influences of climate and social conditions. It is thus a 
particularly suitable country in which to study the conditions which may pre- 
dispose to the development of this disease. Unfortunately, however, the whole 
country is not under a uniform registration law; the field is therefore restricted to 
those states and cities whichare. The statistics from these Registration States and 
Cities are probably fairly accurate; and I think the marked differences in rates 
that are observed, cannot be due to faults of registration, and point to real 
differences in predisposing causes. As a check on the accuracy of the death 
returns the Census Department made an enumeration of the deaths occurring in 
the year ending May 31st, 1900. This was carried out in all districts where there 
was any doubt as to the state of the registration. The results show that so far as 
the cities dealt with in this paper are concerned, the error in no case exceeds 
95 °/,. In only one city was the error greater than 6°5 °/,, while the mean error 
is 3'°5 °/, with a coefficient of variation of 2°225 °/,. 

In addition to the correction for age, a sex correction might also have been 
introduced. In towns of over 100,000 inhabitants—and none with a smaller 
population than this have been used in this paper—the variation in sex distribution 
is not, from our present point of view, of very great importance, while the extra 
arithmetical labour involved in calculating over 150 further correction factors 
is very considerable. In a few cases where the sex distribution seemed to vary 
considerably from the usual proportions, I calculated the sex correction factor, but 
in no case did the alteration in the corrected rate amount to 0°5 °/,. It should be 
remembered that the cancer death-rate is highest for females, while the diabetes 
death-rate is highest for males, thus any error resulting from this omission would 
tend to lower the correlation that exists between these diseases. 


In dealing with large areas of mixed urban and rural populations another 
source of disturbance must be borne in mind, viz. the different age distributions in 
the two districts, with its effect on the death-rates in each. The correction factor 
for the rural population of the ten Registration States in 1900 is for cancer *7585, 
while for the urban areas of the same States it is 10617. Now as the proportion 
of the population in the urban and rural districts differs in each State, this in 
itself will cause an appreciable difference in the observed rates. The following 
table shows the change in rate that would be made from this cause alone, were the 
age distribution of rural and urban districts uniform for each State. 
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Cancer Death-Rates. 











| | 

State | Crude Corrected | 

| | 

| | | 

Maine ... ‘as rie h 837 692 

New Hampshire... | 668 568 

Vermont s se5 818 647 
Massachusetts nce 664 648 

Rhode Island om 623 590 | 

New York ... ae 612 587 
Connecticut ... ve 611 563 
New Jersey ... ea 504 472 
Michigan ie ac 580 498 





It will be noted that this correction although lowering al! the rates does not 
materially disturb their order. 


The correction factor as used for the various States is based only on the age 
distribution of the whole State without reference to the proportions in the urban 
and rural areas. The necessary figures to make this latter correction were un- 
obtainable. 


Throughout the paper the correction factors are based on the census of 1900 
while the death-rates of the cities are those of the five-year group 1900—1904; it 
being assumed that the age distribution has kept fairly uniform, or at least has 
not altered sufficiently to affect materially the conclusions arrived at. In the case 
of the States the 1906 death-rate is compared with the insanity figures for 1904. 
This was unavoidable as no others were obtainable. I do not suppose however 
that the rates changed so greatly in two years that all value from these com- 
parisons is lost. 


In calculating the coefficients of correlation and their probable errors a doubt 
arose as to whether it was right to give every city the same weight. Should a city 
with three million inhabitants be given greater weight than one of only 100,000 ? 
If it were proved that any factor, such for instance as a meteorological condition, 
determined the incidence of cancer, it is clear that weighting for size would be 
wrong. Again if overcrowding was a marked factor it would be unjustifiable 
to weight for size. To take 100,000 inhabitants as a unit was quite arbitrary ; 
I have therefore regarded each city or State as aunit. This reduces to a minimum 
the value of the denominator in the probable error, which is therefore of maxi- 
mum size. 


When first approaching this problem I was of opinion that in spite of the trend 
of modern research it might yet be demonstrated that cancer was of infectious 
origin, either bacterial or protozoic in nature. Many infectious diseases have 
a very marked seasonal incidence, and are in certain cases closely correlated with 
various meteorological conditions. That this is so in respect to enteric fever in the 
Transvaal I have shown elsewhere, and it is I think reasonable to suppose that if 
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an organism has an existence apart from its human host it is quite likely that this 
should be so. Cancer being a disease of fairly long duration, and of uncertain date 
of onset in relation to the time of death, there was no way of finding out whether 
the disease was contracted more frequently at one time of the year than another. 
If therefore meteorological conditions have an influence on causation, it must 
be sought in place rather than in time. For this purpose I have obtained from 
the U.S.A. Government publication Climatology of the United States, average 
meteorological conditions for 27 Registration cities with populations of over 
100,000 inhabitants. 


Corrected Cancer Death-Rate, 27 Cities, 1900—1904. 


Sunshine in hours 

Mean Temperature ens 
Mean Maximum Temperature 
Mean Minimum Temperature 
Rainfall 

Days of Rain ... aa 
Absolute Humidity ... 


Cancer Death-Rate, for 


r= —°0142 + °1323 
r= —'1478+°1294 
r= — °2236 +°1257 
r= — 0148+ °1323 
r= —°2517+°1239 
r= —°1621+°1288 
r= —'2014+°1269 


the 17 Cities. 








Sunshine 
Rainfall 
Mean Temperature ea 
Mean Maximum Temperature 
Mean Minimum Temperature 


r= *1268+°1659 
r= 31634-1518 
r= —°0732 +°1685 
r= —*1509+°1600 
r= 19544-1659 


Rel. Humidity 8 a.m. .- r= 3101+°1519 
Rel. Humidity 8 p.m. . t= *°1603+°1600 
Range of Humidity ... we «6 0724471685 


It will be observed that out of the seven correlations appearing in the table 
none of the values are really significant when compared with their probable errors. 
I had previously calculated the correlations for 17 of the larger cities and values 
of the same numerical order were obtained, but in many cases the sign was 
different. They are appended for the purpose of comparison. These values must 
therefore be considered as zero. 


It will be seen from these figures that so far as 27 cities of the U.S.A. are 
concerned there is no correlation between the average climatic differences obtaining 
in the city and the prevalence of cancer, as measured by its average death-rate 
over the five years 1900—1904. 


It has frequently been stated that cancer is less prevalent in hot climates, and 
there are facts to be obtained in support of this general statement. It is not how- 
ever necessary to assume that this is due directly to climate, and another explanation 
is, I think, probably the right one. Marked differences of race, temperament, and 
pressure of life are to be found in tropical and sub-tropical countries, as compared 
with temperate and cold ones; the exception to this being the case now under 
discussion, the United States of America, 
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So far then as these figures go we have no evidence to connect meteorological 
factors with the variable incidence of cancer. This, although a negative result, is 
not, I think, without its importance. 

Finding that the theory of the infectious origin of this disease gained no 
support from the values thus discovered, it occurred to me to correlate the cancer 
rates with those of some other disease. For this purpose I chose diabetes, and for 
the following reasons. (1) Both diseases have very much the same age distri- 
bution. (2) They stand almost alone as being on the increase, while other causes 
of death show declining rates. (3) The aetiology of both diseases is obscure. 
(4) Both being diseases of old age, the heredity factor cannot have been increased 
by the results of modern medical skill, for the prolongation of life that might thus 
have been procured would have been mainly at a period when the procreative 
power was passed. In fact the tendency of modern times is rather to postpone 
marriage to a later period of life, and this might be expected to reduce any 
heredity factor there may be. (5) If there were a common factor in the causation 
of the dual increase a correlation between these diseases might be discovered. 

There seemed then sufficient justification for undertaking the labour of 
calculating correction factors for the diabetes rates. As was to be expected both 
correction factors are similar in value. 

Professor Pearson has shown that when correlation tables are formed between 
rates, in such a way that a common factor occurs in both the variables, a “ spurious 
correlation” is obtained. In this case therefore the usual formula 7 = S(«y)/Nozo, 
can no longer be employed. On his advice* I have therefore used the following 
formula when dealing with the correlation as existing between rates: 


Vay — VeaeN ey 
Va- 17) (1 — ray) 

Here «=the number of deaths from one disease, y= the number from a second, 
and z= the number of individuals in the district. Throughout this paper where 
the symbol p occurs it refers to values found by this partial correlation formula. 

In dealing with diseases like cancer and diabetes, a significant correlation 
will probably occur when uncorrected deaths are used, due to their similar age 
incidence. This will be most marked in the value found for the races, and 
occupation groups, and least so for the cities. This will be seen by comparing 


the “Coefficient of Variability” for the cancer correction factors for the different 
groups : 


Pay = 


Cities V. = 11°46 °/.. 
States V, =1457 °/, 
Races V-=dede /.. 


28 Occupation Groups V, = 43°86 °/.. 


* [This formula gets rid of the ‘‘ spurious correlation” due to forming ‘‘ rates,” i.e. the common 


population totals, but it does not of course get over the high spurious correlation of the so-called 
‘*age correction factors.” Ep.] 
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As, therefore, using corrected deaths introduces “spurious correlation” into the 
value of p, and as using uncorrected deaths will introduce a correlation due to 
the age factor, the values of p calculated, both with and without the use of a 
correction factor, are given*. The symbol p’ will be used to denote values found 
from corrected deaths. The necessity for correcting for age is not of such im- 
portance when comparing cancer with suicides or insanity. 


The partial correlation coefficients found for cancer and diabetes are as 
follows : 
40 Cities, 1900—1904, p =°6896 + 0559, p’ = "7325 + 0494. 
15 States, 1906, p= "9088 + 0303, p’ = "8258 + 0554. 


These values were sufficiently striking to lead one to further consider the 
matter. 


From figures obtained from Vital Statistics (Vol. 1.) U.S.A. Census, 1900, 
I calculated correction factors for the cancer and diabetes death-rates of the various 
races residing in the Registration States, and classified according to birth-place 
of mother. The correlation thus obtained for these two diseases is certainly 
significant : 
p='8609 + 0552, p’='5442 + -1501. 


That these values are not due to errors of random sampling is shown by 
their probable errors, for the odds are some millions to one against such an 
occurrence. Nor does it seem likely, that they are entirely due to “spurious 
correlation” in the p”’s and to an entirely different source in the case of the p’s. 
To what then are these correlations due? It seemed possible that the agree- 
meu of rate might be due to certain cities having more efficient registration, 
although the remarks of the Registrar did not lend much support to this theory. 
He writes: “The ‘registration area’—that is to say, the States having laws, the 
results of whose operation have been accepted as giving practically complete 
mortality returns, together with the cities in non-registration States where deaths 
are satisfactorily registered under local authority—remains substantially the same 
from 1900 to 1905. The geographic distribution is shown in the accompanying 
map” (Mortality Statistics 1905, U.S.A.: see our Fig. 3). 


I have already referred to the special returns of deaths made by the Census 
Department in 1900. The small value of the Coefficient of Variation (2225 °/,) 
in the “percentage error of registration” is I think sufficient evidence that the 
correlation values are not due to errors in registration. If these high correlations 
were due to varying efficiency in registration, then other diseases might be 
expected to show a similar value when correlated with cancer. It will be seen 
further on that there is no significant correlation between cancer or diabetes 


* [I believe I have found a satisfactory method of making both age and population corrections free 
from spurious correlation, This method will shortly be published. Ep.] 


Biometrika vir 37 











286 A Statistical Study in Cancer Death-Rates 


and diseases of “ill-defined or uncertain cause.” Probably one of the most suit- 
able diseases as a control would be nephritis, were it not that its age frequency 
is so similar to cancer, thus the age factor will give rise to errors to an even 
greater extent than occurs in the case of diabetes. 








FLO 
Registration States, 1900. 
United States of America RNG ” added, 1906. 
° Registration Cities dealt with in this paper. 
Fic. 3. 


The values of p and p’ are given for this disease as they may serve as a useful 
contrast to the other correlation values. In calculating the value of p’ the cancer 
correction factor was used for the nephritis deaths as the age frequencies of the two 
diseases are not very dissimilar. Cancer and Nephritis, 40 Cities: 


p ='3792 +0913, p’="1571 + 1040. 


These values are of a quite different order from those found for cancer and 
diabetes, 
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Unfortunately no figures were obtainable to allow of a correction being made 
for those deaths which occurred in hospital and did not belong to the district. 
Although this correction is most.desirable, yet it is not likely to very greatly 
affect the correlation values for cancer and diabetes, as the latter disease is not 
one that bulks very largely in hospital deaths. It is not a disease very prevalent 
among the class of patient that frequent hospital, and from its nature would 
not be admitted unless combined with some acute affection. This of course will 
occur in a certain number of cases but to a small extent as compared with cancer. 
This objection will not apply to the returns from the States, and all the cities 


here dealt with, except a few eastern ones, will draw from fairly large rural 
areas, 


Before coming to the conclusion that the correlation found between cancer and 
diabetes is of real significance, one must as far as possible exclude all sources 
of error that might lead to false deductions. The question of the inaccuracy 
in the records from incomplete registration has already been discussed, but there 
is another form of inaccuracy that must be considered, viz. that arising from want 
of care and knowledge on the part of the certifying medical practitioner. In 
Mortality Statistics U.S.A. 1907, the following remark occurs: “In Table IV. may 
be found two columns headed ‘ill-defined causes’ and ‘cause unknown’ which are 
of no special significance in themselves, but which are nevertheless important, 
because they show the margin of understatement that must be considered in 
examining the rates for more definitely stated causes of death.” It seemed 
therefore desirable to correlate cancer and diabetes death-rates with a five-yearly 
average rate of these groups, for the 40 cities occurring in the other tables. The 
values found are as follows: 


Cancer and diseases of ill-defined and unknown cause, p='1020 +°1055. 


Diabetes and diseases of ill-defined and unknown cause, p=°0773 +°1060. 


It does not seem likely therefore that any appreciable correlation will have arisen 
from this cause. 


It has been suggested to me that in towns well equipped with competent 
medical men these diseases would figure less in the death returns, because by 
means of operative treatment in the case of cancer, and dietetic treatment in the 
case of diabetes, death might eventually be due to some other cause; whereas 
where medical skill was inferior, or the population not sufficiently educated to take 
suitable precautions in regard to diet, or submit to operation, a larger number 
of cases of both diseases would appear in the records. In the first place I do not 
suppose that the average medical skill would vary very greatly in cities of the size 
dealt with in this paper. It is no doubt true that in both diseases, life may be 
prolonged by skilful treatment but a cure is not effected, except in very rare 
instances. When death occurs from an intercurrent disease the older complaint is 
included on the death certificate. At least this should be done and it is less likely 
to be omitted where the medical skill is of a high order. In classifying causes of 


37—2 











288 A Statistical Study in Cancer Death- Rates 


death it is the rule to select the diseases ordinarily known as constitutional 
in preference to those known as local diseases. This will be made clear by an 
extract from the New Tables issued by the L.G.B. and the Schedules of causes of 
death : “Thus Cancer should be selected in preference to Pneumonia, and Diabetes 
in preference to Heart-disease.” Further the groups already dealt with, viz. 
“Tll-defined and unknown causes” are, perhaps, the best statistical guide to the 
efficiency of the local medical profession. 


Professor Pearson has kindly drawn my attention to another possible source of 
error. If it were shown that both diseases were more likely to attack persons in 
the same social class, or engaged in the same occupations, an artificial correlation 
might then be set up, owing to the inhabitants of the different cities being 
engaged in occupations of a diverse nature. It will be seen later that cancer and 
diabetes are probably negatively correlated in respect to the occupational groups. 
There is I think no doubt that diabetes is a disease especially prone to attack the 
wealthy and sedentary classes, but in the U.S.A. this does not seem to be true of 
cancer. This question is of importance because Mr Heron has by a study of the 
London statistics arrived, by indirect methods, at an opposite conclusion. From 
the U.S.A. Census Reports it was possible to obtain the numbers of men employed 
in various occupations, together with their age distribution, and the cancer and 
diabetes death-rates. These are given first in eight inclusive groups and then 
in sub-groups. For the present purpose correction factors for the death-rates 
were calculated for the eight large groups and for 28 sub-groups as shown in 
the table. The various occupations were then arranged in order according to their 
social status, but as it was unlikely that two people would place them in exactly 
the same order I asked a friend to rearrange them for me and the results of 
both orders are given. The correlation was found by the method of ranks as given 


by Pearson*, 7 =2sin F Pus where py=1—6S8(»,—»v,)/N (N?—1). Correlating 
high death-rates with superior social occupations the values are as follows: 


Eight Inclusive Groups. 
Cancer and Occupations r =—'5176 + 1746. 
Diabetes and Occupations r= 9374 + ‘0289. 


28 Sub-groups. 
1st grouping 2nd grouping 
Cancer and Occupations r=—'4574+4°1007, r=—'3680 +1102. 
Diabetes and Occupations r= ‘67164°0700, r= ‘6748 + 0694. 
To check the results thus found four “four-fold tables” were constructed, the 
division of the death-rates being taken at their mean, and the occupations being 
divided at Nos. 9 and 17 of the order as shown in the table. 


* “On Further Methods of Measuring Correlation,” Drapers’ Research Memoirs, Dulau and Co. 
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There seems to be no doubt that in America there is a negative correlation 
between cancer and better social class, as measured by occupation, while there is a 
high positive correlation between diabetes and better social conditions. Thus 


Cancer and Occupation. 
Death-rate. 





Diabetes and Occupation. 
Death-rate. 


























H. L. H. L. H. L HL 
Siete 5 | . | ah re 4 —_— a a 
me 7| 9| | 5112/17 High | 9| 0 9 lau} 6|a7 
Social Social 
Status Status 
Low | 11; 8/19 8| 3/11 | Low | 3/16 | 19 | 1/10} 11 
pl Bik Se eae |/—|—|— oe 
13 | 15 | 28 | | 13 | 15 | 28 12 16 | 28 12 | 16 | 28 
Sok Ee oe at a gees a ier 
=-—'56+°09, r=—'63 +08. r= 10, r= °82 + 04, 


there does not seem any reason for supposing that an artificial correlation will 
have arisen between cancer and diabetes death-rates owing to prevalence of 
varying occupations in the different cities. 


If the increased rates observed in the cases of cancer and diabetes were due to 
a common cause then it is probable that their rates of growth will be found to be 
fairly highly correlated. Not being able to obtain rates for a sufficient period of 
time, from the United States reports, the rates given in the Registrar’s Report for 
England and Wales, in five-yearly groups, for the 35 years 1871 to 1905 were used, 
and here p=°8060 + 0893. This high correlation shows I think a strong proba- 
bility that there is a common factor influencing the increase of both diseases. 
That this value is explicable on the assumption that the increased rates are 
merely apparent and due to more careful diagnosis is in view of the facts already 
mentioned almost inconceivable. It is equally clear that the increase is not due 
to alteration in the age constitution of the population. For example had the same 
cancer death-rate been operating in 1851—60 as was so acting from 1891—00 the 
death-rate would have been 743 per million, and in 1871—80, 749, while in 1891—00 
it was 758. Thus in 50 years the increased ratio of old people would have led to 
a gain in rate of only 15 deaths per million of the population, whereas the recorded 
increase in the last 35 years 1871—05 is 534 per million. It has been previously 
pointed out, that no great increase in our knowledge of either disease has recently 


been made, that would aid the general practitioner in more accurate diagnosis in 
the final stages of the disease. 


If then we have succeeded in excluding artificial sources of correlation, it 
remains to find, if possible, the factor that is producing the increasing rates in 
these two diseases, and giving rise to their correlation in the various cities and 
districts. 


Only one cause, it seems to me, will fit the facts as we know them, viz.: the 
pressure of modern civilisation and the strain of modern competition, or some 
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factor closely associated with these. Radio-activity and various electric phenomena 
have from time to time been accused of producing cancer. The increased use of 
high tension currents is an undoubted fact in modern city life, but a glance at the 
varying rates as applying to different occupations will suffice to show that the 
explanation does not lie here. Another marked feature of modern life is the 
increasing use of preserved foods. But again the death-rate as evidenced by the 
occupation groups seems to negative this as a probable cause. For instance the 
agricultural rate is higher than that for the military and police. 


The increasing pressure of competition with its attendant nervous strain 
appears to be the factor in the aetiology of both diseases that will account for the 
phenomena. It has long been recognised that nervous shock and worry may be a 
determining cause in diabetes. 


At once another disease becomes associated with this factor in modern life. 
Insanity, also on the increase, should show a positive correlation with cancer. 


Unfortunately figures of the insane rates for the various cities in the United 
States are not obtainable, so on the suggestion of Dr Troup I have correlated 
suicides as being the nearest approach to a measure of the prevalence of insanity. 
Doubtless suicides form some guide to the stress and strain of life in the various 
districts, but other factors have probably also an effect in determining the prevalence 
of suicide. Religious beliefs, and moral training, may have a marked influence. 
The correlation found for cancer death-rates and suicides for the 40 cities, for the 
five-year groups, is: 

p = ‘3962 + 0899, 
and the 15 States for the year 1906: 

p= '4412 + 1402, 
and for the Races according to birth-place of mother, 1900: 

p = 6010 + 1362. 
Although these values are not high they are possibly significant ; and in the case 
of the races of the same order as that found to exist between the insane enume- 
rated in hospital and suicides for the 15 States which is: 

p = "6062 + °1101. 
In the case of correlations found for the States and Races, it must be remembered 
that much greater fluctuations may be expected owing to the values being calcu- 
lated on the returns for only one year, while in the case of the Cities it was possible 
to obtain a steadier result by averaging the returns for five years. 

From the special report on The Insane and Feeble Minded in U.S.A., published 
by the Census Office and dealing with the returns of 1904, I was able to obtain 
figures for the insane as they apply to the various States. 


Particular caution is required in dealing with these figures, and a few extracts 
from the introduction to the report may enable the reader to estimate better the 
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true value of the results. The report deals only with the insane in institutions. 
“No institution for the insane, except a very few small private institutions, refused 
to give information. Therefore so far as all classes of institutions are concerned 
the present enumeration is practically complete,” p. 6. “To offer any estimate of 
the probable number of the insane that would have been disclosed by an investi- 
gation which included those not in institutions in addition to those in hospital on 
31/12/03 would be a mere guess. But it is certainly within the truth to say that 
there were as many on that date as were found outside of hospitals in 1890, some 
30,000,” p. 8. This number refers to the whole of the United States and not alone 
to the 15 States dealt with in this paper. “In many cases the high rank given 
some States, for instance, Massachusetts, New York, Connecticut, New Jersey, 
California, &c., probably approximate actual conditions,” p. 4. 


Two correlation tables were prepared so that the matter might be viewed in 
somewhat different lights, and as much information as possible obtained from 
the figures. It seemed probable that yearly admission might be an even better 
guide to the prevalence of insanity than the actual number in hospital on any 
given day; because where accommodation was insufficient for the pressure 
occasioned by cases waiting for admission, it would be possible to mitigate this 
by discharging chronic and convalescent patients somewhat earlier than would be 
done by hospital authorities where there was more ample accommodation. If this 
were so then the admissions would be a better guide to the prevalence of the 
disorder than the actual number in the institutions. The values of the correlations 
for cancer death-rates with the insane are as follows: 


Insane enumerated 31/12/03 p = ‘8711 + 0420. 
Insane admissions 1904 p =°9173 + 0278. 
Diabetes and Insane admissions p = °8074 + 0608. 


These values seem to give additional support to the theory of a nervous factor 
in the aetiology of cancer, and it would be interesting to find an independent 
measure of the stress of life in the different districts, that could be used as an 
index to the intensity of the struggle for existence. 


In a general way one would expect that in the larger cities the competition 
would be greater than in the smaller ones, although this would probably not be 
true of cities with only small differences in population, and other factors besides 
mere size would have to be taken into consideration. No small cities occur in 
the tables here used, and the correlations between population and cancer or 
diabetes death-rates are scarcely significant : 


Cancer death-rate and population p= "1049 + 1055. 
Diabetes death-rate and population p= *3007 + 0970. 


I thought that a rough measure of the social and economic conditions pre- 
vailing in a district might be obtained by reducing the number of periodicals 











292 A Statistical Study in Cancer Death-Rates 


and newspapers printed in the various States to the rate per head of population, 
and correlating with cancer death-rate ; the value thus found is: 


p = "6182 + “1076. 


I record these observations for any interest that they may have, but do not think 
the‘ the deductions should be pushed too far. 


Another aspect of the question that requires investigation is the death-rate 
among the various occupations. No reliable insanity figures can be obtained that 
correspond to the grouping for occupations as given in Vital Statistics, Vol. L., 
U.S.A. Census, 1900. In fact the relation of insanity to occupation has not been 
worked out for the United States. As the Editor of the Report above referred to 
says “On the whole, the question of the relation of insanity to occupation is one 
that still awaits systematic treatment. The available means of enquiry do not 
suffice even for a perfect classification of occupations, much less for obtaining data 
which permit exact comparisons with the total number of the insane drawn from 
any given occupation with the total number of the same age, &c., engaged in that 
particular occupation.” 


Considerable difficulty arises in correlating cancer and diabetes in respect 
to occupations owing to the large value of the Coefficient of Variability of the 
correction factor for age distributions, in the various occupations. An inspection 
of these factors will show that an appreciable positive correlation may be expected 
from this cause, on the other hand the value of p’ will contain a spurious corre- 
lation of unknown size. Taking into consideration these facts and also that the 
negative sign of the correlation as found for the eight groups is unchanged even 
when uncorrected deaths are used, it will probably be safe to conclude that 
the correlation between cancer and diabetes death-rates with occupation groups 
is negative. 


Occupations. Cancer and Diabetes. 


8 Inclusive Groups p=— 0645 + °2375, p’ =—°6719 +°1308. 
28 Sub-groups p= °‘2866+°1170, p’=—°4832 +:0977. 


At first sight this may seem unexpected, but I do not think it is really surprising. 
The rates of both diseases vary considerably in the different occupations. It must 
be remembered that certain occupations consist of highly selected individuals, and 
thus the unstable in certain respects may have been weeded out. Whereas a 
similar condition imposed on a city or district may lead to an increase in both 
rates, conditions of work &c. may determine which form the manifestation of 
instability shall take. For instance occupations associated with a sedentary life 
probably predispose to diabetes. 


The use of an analogy will better explain my meaning. The death-rates from 
Typhoid Fever and Bullets during time of war would be much increased, and both 
in relation to time and place would show positive correlation with war. But if 
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these rates were to be compared in relation to deaths in occupations—soldiers at 
the front with men in garrison—no positive correlation would be observed. Thus 
under the heading of occupations we may be measuring rather the determining 
causes of the type of breakdown, than the fundamental cause of the disease. 


I may perhaps be allowed briefly to refer to the theory which has been adopted 
as a working hypothesis in the present investigation. As far as this investigation 
is concerned there does not seem to be any evidence as to the infectious nature of 
cancer. Nor has any evidence been adduced that would lead to this belief with 
the possible exception of cancer-houses, while on the other hand, all the improve- 
ment brought about by public health reform, in the way of reducing over-crowding, 
abolition of back-to-back houses, &c. has not been followed by any reduction of the 
disease, on the contrary an actual increase has been taking place. That nervous 
centres do exercise an extraordinary influence over tissue cells is a matter of 
common knowledge and that some nervous disturbance may affect the cell 
metabolism in diabetes, and the cell growth in cancer does not seem to me 
impossible, even if a novel view. This suggestion does not necessarily exclude the 
presence of an organism as the determining cause, and several analogies will occur 
to the reader. For instance in a disease like scurvy, which is almost certainly of 
infectious origin, the organism does not usually obtain a footing until the soil has 
been prepared by errors of diet. 


It must be understood that the strain occasioned by modern competition is 
only regarded as an exciting factor, in so far as it leads to a breakdown of nervous 
origin. This breakdown of course may occur without any undue strain, but will in 
the aggregate do so more frequently when this factor is present. The heredity 
factor, i.e. the constitutional weakness, is of course of primary importance. 


Problems connected with the occurrence of cancer in animals are at present 
difficult to deal with statistically. That malignant tumours occur in vertebrates 
is well known, as the following extract from the Third Report of the Cancer 
Commission shows :—“ Cancer is ubiquitous in man and vertebrate animals. It 
has been shown that cancerous tissue of a species of animal retains the characters 
of that species, whereas in infective tumours (e.g. tubercle) occurring naturally 
in separate species, the biological characters of the newly formed tissue are 
determined by the common infective agent and not by the tissues of the affected 
animal.” 


Again, the question of cancer in African Natives requires more study, and the 
increasing rates for these races as observed in America are of interest. The 
phlegmatic nature of the Bantu in Africa is well known. The prevalence of 
Diabetes among Jews is another problem of interest that awaits explanation. 
This race, however, seems particularly liable to certain nervous complaints, as for 
instance various forms of insanity, and Amaurotic Family Idiocy. 


Since writing the above I have been looking through the small literature of 
insanity at my disposal and have been struck with the following sentences :—“ It 
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has been said that Diabetes, or Glycosuria at least is frequent among the insane. 
There is, however, some connection between Diabetes and Insanity, as we shall 
proceed to point out. Dr Maudsley has recorded his experience that it is not 
uncommon to find a history of Diabetes in the parents or near ancestors of insane 


patients; this is also our experience, especially among the affluent classes.” 
G. H. Savage. 


“Dr Snow informs me that independently of actual brain-deposits, insanity 
is not uncommon among cancer patients, and that the incurable wards of the 
Cancer Hospital are seldom without two or three cases.” (Dict. Med. Psych., 
Hack Tuke.) 


Several considerations have led me to offer this paper for publication in its 
present form. Firstly, I venture to hope that the subject will be considered 
of sufficient importance to induce other biometrical workers to assist in its 
elucidation ; secondly, the time at my disposal is limited while much remains 
to be done; and thirdly, much of the work is impossible to a resident living 
away from the big libraries, as many books of reference are unobtainable. Even 
such districts as have been dealt with could be better handled by workers 
conversant with the districts and their local conditions. Lastly, the publication 
of a preliminary paper may bring helpful criticism and so enable future work to 
be more profitably planned. 


It would much enhance the value of the Death Reports, if the Registrars 
could see their way to add special tables of the causes of death, in cases where 
the diagnosis has been confirmed by post-mortem examination. A list of 
associated factors found in each case might be added, much on the lines 
adopted by the London Asylums Board. Thus there would in time grow up 
a body of statistical material of the utmost value for determining the true age 
distributions of the various diseases, and for obtaining many other important 
pathological facts. 


I would take this opportunity of thanking Professor Pearson for much kind 
advice, and also Dr J. M’D. Troup and Mr G. Clark for their assistance during 
the compilation of this paper. 


Conclusions. 


1. That recorded differences in cancer and diabetes death-rates, as applying 
to different Districts and Cities of the U.S.A., as well as the increased rates 


observed in recent years, do indicate real differences in the prevalence of the 
disease. 


2. The correlations found to exist between cancer, diabetes, and insanity are 
not fortuitous and due merely to errors of observation or record. 


3. The statistics dealt with in this paper do not give any support to the 
suggestion that cancer is of infectious origin. 
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4. That whatever theory as to the causation of cancer is adopted, some 


explanation of the remarkable correlations between cancer, diabetes and insanity 
is required. 


5. The explanation suggested to account for the facts as disclosed by 
statistical analysis is, that conditions of modern life, acting on physiologically 
unsound material, may account for the correlations existing between these three 
diseases, as well as for their increasing rates. 


6. If this suggestion be correct, then the study and teaching of Eugenics 
has an additional purpose. Although no hope for the sick can be derived from 
this source, yet the future of the race will not be so hopeless, when children 
are taught the eugenic law, that marriage with the physiologically unfit is as 
anti-social, as marriage with their social inferiors is at present held to be. 
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TABLE ILI. 


Diabetes Death-Rates. 


Constants of Curves. 





Suggested Old Age 


Total Data Mortality 
Mean 66°9786 70°7810 
Mode 860008 (Type J) 75°5300 
S. D. 32984 22603 
VB, 1:3989 0°7403 
Be 51345 3°6254 
Mid-Ordinates 
Age Observations 
Old Age a 
| Component Total Curve 
ek a (ae ere 
25 13 | 0 8 | 
15 19 0 19 
12°5 36 0 29 
17°5 38 | 1 38 | 
22°5 36 2 45 
27°5 51 5 51 
82°5 57 11 57 
87°5 65 24 | 67 
42°5 102 49 86 | 
4rd 117 93 122 
52°5 204 163 182 
575 | 238 269 277 
62°5 443 403 403 
67°5 579 553 553 
125 627 662 662 
105 685 | 676 676 
82°5 585 528 528 
87°5 201 | 252 252 | 
92°5 39 29 29 | 
97°5 3 | 3 3 
| | 


L 6+1835 j 
al Curve y=2280 oe ssi 
Total Curve y (1 + angi) (1 01 


Suggested Old Age Component =691°7 (1+ > 


XL 


34° 


1 


. — 0-0262 
5) F 


x“ 26-5162 
sas) (1 
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TABLE IIL 


England and 





Wales. 


Cancer Deatu-Rate 
(per Million) 





1891—1900. 


Diasetes Deatu-Rate 
| (per Million) 





| 
| 

















Age | So See ' = 
Male and } Male and 
| Female Male Female | Male 
= aaa te es 
0 30 33 4 5 
a 1618 18k = | 12 | 12 
15 37 41 | 33 37 
25 139) 99) >. 49 | BT}, 
35 | 6465 300 84 |) e779 | 
45 1833) o-a- 1300) ; 136) « on 
55 | 3662, 2°87 3160; 2032 3005 201 317; 234 | 
65 5604) -- 5325) » 460 559) . 
rn S| 15, 5766 5824) 5461 380; = | aa | 
Male ee Re ee 
Total Rate 757 600 75 | 86 | 


TABLE III bis. 


Death-Rate (per Million). 


England and Wales. 











Year Cancer | Diabetes 
1871—75 446 36 COC 
1876—80 | 494 40 
1881—85 | 548 51 
1886—90 i 632 62 
1891—95 711 69 
1896—00 | 800 81 
1901—05 | 864 89 
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TABLE IV. 


Meteorological Conditions. 


Average of about 30 years. 





City 


Baltimore 
Boston 
Buffalo 
Chicago 
Cincinnati 
Cleveland 
Columbus 
Denver 
Detroit 
Indianapolis 
Kansas City 
Louisville 
Memphis 
Milwaukee 
New Haven 
New Orleans 
New York 
Omaha 
Philadelphia 
Pittsburg 
Rochester 
St Louis 

St Paul 


San Francisco ... | 


Toledo 
Washington 


Atlantic City a 


Mean 


Hours 
Sunshine 


219 
208 
191 
218 
216 
173 
206 
251 
202 
207 
212 


Mean 
Temperature 


Maximum 
Temperature | Temperature 





| 





Absolute | 
Humidity | 
in grs. 


3°71 
3°15 
3°03 
3°17 
3°46 
3°21 
3°37 
2°03 
3°11 
3°38 
3°63 
3°67 
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TABLE V. 
| Cancer. 
a —_ -_ a - — 
| 
F Chiieiatiiinds Crude | Corrected | Population 
; | City Peston | Death-Rate | Death-Rate | per 1000's 
| 1900—04 1900—04 | 1903 
* | ———— - | ae | See: 
Allegheny Sad 1°2349 435 | 537 138 
Baltimore te, 10543 | 752 793 531 
Boston ED, | 1°0456 | 896 937 581 
Buffalo ‘sa 1°1940 687 820 367 | 
Chicago ui 1°3224 635 | 840 1873 
k Cincinnati te 1°0384 781 811 332 
| Cleveland ne 1-2073 582 | 703 | 414 
4 Columbus 1°0783 651 | 702 135 
: Denver 1°0876 655 712 | 147 
Detroit 11383 709 807 310 
| Fall River 1:2389 608 | 753 105 
Indianapolis 10313 578 | 596 200 
Jersey City 12151 516 627 222 || 
Kansas City 11791 553 652 173 | 
Louisville 1-0908 594 648 216 | 
Memphis 1°3327 373 497 114 
Milwaukee 1°2289 678 833 303 
Minneapolis ies 1°2289 587 | 721 238 
New Haven So “9882 783 774 115 
New Orleans 1°1035 736 812 301 
3rooklyn 1°1557 627 725 1280 
Manhattan 1°2701 743 944 | 2001 
Queens ane 1°1921 474 565 180 
Richmond as 9947 821 816 71 
Newark 11215 686 769 267 
Omaha 1°2701 519 659 113 
Paterson 1°1401 631 719 109 
Philadelphia 1°0341 699 723 | 1368 
| Pittsburg = 13211 519 686 345 
Providence ae ‘9793 850 832 189 
| Rochester | 1°0027 806 808 174 
St Joseph 1°4585 306 446 110 
St Louis 11610 622 722 612 
St Paul wag 1°3051 555 724 183 
| San Francisco 7 . 9546 1240 1184 | 356 
| Scranton ms 1°3351 420 561 110 
Syracuse bv ‘9668 760 | 735 114 
Toledo Ree 11383 | 574 653 146 
Washington ae 9439 745 703 | 293 
Worcester vids 1°0385 | 728 756 124 


| 
| 
Mean ses — — | 735°05 374-00 | 
| 128-01 | 449°24 | 
| 
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TABLE VI. 
| | DIABETES | 
Diabetes es | sys 
: P Suicides Nephritis 
saad — " | Ornde Corrected | 1900—04 | 1900—04 
| Death-Rate | Death-Rate 
| 1900-04  1900—04 | 
| Allegheny 11381 52 59 | 2145 574 
Baltimore 10173 99 101 113 1446 
| Boston 10013 | 151 151 136 858 
Buffalo 11194 146 163 | 111 | 1012 
Chicago 1°1773 85 100 228 1145 
| Cincinnati 10081 90 91 217 1256 
| Cleveland 1°1261 59 66 158 840 
Columbus 1°0260 83 85 139 702 
| Denver 10331 121 125 239 1098 
| Detroit 1°0807 113 122 | 112 751 
| Fall River 1°1450 114 131 61 1059 
| Indianapolis “9907 96 95 | 110 668 
Jersey City 1°1346 83 94 174 1384 
Kansas City ide 1°0791 65 70 | 161 1080 
Louisville w. | 10388 7 | 7 | 197 1043 
Memphis . | 11867 Ss | oe 2 oe 1370 
Milwaukee te 11556 94 | 109 215 623 
| Minneapolis .| 11322 79 | so | 18 | 721 
New Haven ‘9817 151 | 148 158 | 966 
New Orleans 1°0699 61 65 137 2065 
Brooklyn ee 1°0885 a 128 19 1740 
| Manhattan et 1°1504 152 175 225 2218 
| Queens 11261 94 106 216 1993 
Richmond "9855 144 142 152 1663 
| Newark 10623 112 119 238 1533 
| Omaha ee | 1°1468 64 73 | 142 602 
| Paterson “well 10791 93 100 90 1005 
| Philadelphia 1-0000 98 98 | 135 1492 
| Pittsburg 11811 47 55 155 767 
| Providence ‘9628 162 156 110 1324 
| Rochester des 9817 135 132 137 885 
| St Joseph es 12542 46 58 105 486 
| St Louis was 1°0885 68 74 258 1355 
St Paul we | 171811 74 87 125 724 
| San Francisco 9317 176 164 550 1070 
Scranton 1°2136 93 113 76 972 
Syracuse ae 9566 116 111 105 890 
Toledo van 1°0823 106 115 139 645 
Washington oe 9282 107 99 149 1103 
Worcester 1:0067 131 132 101 713 
| Mean ; | cone 107-02 | 154°44 1117-15 | 
o _ _ 32°61 80°82 436°58 
| 











emcee V 
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TABLE VIL. 
States Registration. 









































CANCER | D1aBETES 
| Death-Rate, 1906 | Death-Rate, 1906 | Suicides | 
: | Correction | ~ “*) "> | Correction | °**?- Hate, IS | 
; | Factor ; TexG7 Factor 4 
| | Crude | Corrected Crude Corrected 
/ Sr co es — Se 
| Maine ina “7081 862 610 "7622 168 | 128 104 
| New Hampshire ... “7135 892 636 7653 169 | 129 86 
Vermont oa “7049 853 601 ‘7622 | 183 139 140 | 
: | Massachusetts ... “9087 903 820 ‘9135 | 161 | 147 96 | 
} | Rhode Island i 9415 | 783 737 ‘9387 me | ies | ie 
| New York ga 9510 | 762 | 725 "9422 160 151 144 | 
Connecticut as ‘8731 806 703 8897 188 Gg |] a 
| New Jersey ... | 1°0079 661 666 9894 128}. SZ: | 
Michigan Pa “9334 676 | 631 9434 te. |. ee o-) 
| Indiana ree ‘9780 537 525 ‘9740 110 107. | 116 | 
Maryland av 1°0384 601 624 1°0176 96 98 89 
Pennsylvania .. | 10485 607 636 1°0218 108 |; 110 | 113 
South Dakota ... | 1°2308 | 354 | 436 | 11645 | 88 | 102 | 69 
| Colorado ‘S | 1°1828 513 607 1°1029 84 | 93 | 190 
California “+ | *8310 | 920 764 "8532 161 137 | 289 
| 
| ae =eme, ed ony ‘ 
Mean whe) - 648°07 _ 127°80| 131°93 
-. 5 - 91°94 “ 21-30 52°26 | 
| Ce ae 
TABLE VIII. 
States Registration. 
Insane (per million) 
Newspapers 
| - to 
Enumerated | Admitted Persons 
31/12/03 1904 
| 
Maine i 125 47 “920 
New Hampshire ... 117 83 522 
Vermont un 255 77 561 
Massachusetts ... 289 133 2°270 
Rhode Island a 235 92 500 
| New York ee 339 86 4-510 
| Connecticut AE 293 89 505 
| New Jersey ins 238 65 1°612 
| Michigan eal 216 58 “988 
Indiana at 166 53 841 
| Maryland i 202 66 ‘573 
| Pennsylvania aol 173 50 ° 1°837 
| South Dakota wef 141 32 586 
| Colorado a 129 86 | “984 
| California vee | 361 98 1-025 
Sts ————|-— ae a aS eee 
Mean x 218°60 74°13 1:22 
| o¢@ das 75°50 24°17 1-02 
Biometrika vit 39 
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Registration Districts. 


CANCER DIABETES 
ae | ee | 2 : | Suicides | 

C tie} Crude | Corrected C on Crude | Corrected | 

Fact 10n | Death-  Death- Fact 10 | Death- Death- | 

= Kate Rate weg Rate | Rate | 
is a. | ne fe PRES Re She ee 

| 
United States of America... | 9276 | 483 448 *8380 96 80 | 68 
Ireland nis Be vo | 8149 | 764 623 7239 104 75 61 
Germany . 7 *8592 | 782 672 "7653 122 93 193 
England and Wales” | 7400 | 720 533 “6818 121 82 104 
Canada | 1°3591 | 403 548 1-0653 63 67 65 
Scandinavia... | 1°4474 311 450 11161 33 37 | 107 
Scotland cae cant ‘6901 818 565 “6649 123 82 116 
Italy | 1°6935 228 386 | 1°2438 26 32 | 51 
Russia and Poland . v. | 2°0626 | 257 530 | 1°4296 33 47 | 358 
France ae Ae ned See 928 596 | +6000 129 77 220 
es. Ms Wa DK AMES omerS : : fink, Se 

Mean * 535°10 67°20 _ 
o 82°67 19-96 
TABLE X. 
Occupations. United States of America, 1900. Registered States. 


Professional ... bes 
Officials, Clerks, ete. 
Mercantile Trading, ete. 
Public Entertainers 
Service, Military, Police 
Manufacturing 
Agriculture 


| Le abourers not Agr icultural et 


Mean 


TABLE IX. 





A Statistical Study in Cancer Death- Rates 


Death-Rate according to Birthplace of Mother in 


United States of America. 











CANCER DIABETES 
| | 
Crude | Corrected oe Crude | Corrected | 
—— Death-  Death- ——— Death- | Death- | 
. Rate Rate —- Rate | Rate | 
— — — ——__—__— — — 
‘6764 517 350 | ‘6771 222 150 
8167 =| 377 308 ‘7762 181 140 
"7472 520 388 7257 194 139 
“7642 353 270 7251 159 115 
“7075 429 303 “6986 87 61 
‘7653 | 533 408 ‘7401 112 83 | 
5484 692 379 “5960 133 79 | 
*8054 | #665 536 “7678 79 61 
350°22 — 104°44 


99°54 





a 
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TABLE XI. 








CANCER DIAsETEs 


| 








| 
Correction | Death- Corrected Correction Death- | Corrected | 

i Factor Rate Doth. Factor Rate Desth- 

B } Rate Rate 
PS 3 ee a T Urer a atl ae Mk Ae .: 
| Bankers oe 3617 414 150 4350 207 90 | 
| Law oats “5234 525 275 “5669 315 179 

Physicians wee 5355 675 361 ‘D741 169 97 
Clergy a 4282 | 1022 438 4903 | 341 167 
| Officials a ‘8168 377 308 7762 181 | 140 
| Hotel Keepers Sta "4483 | #601 269 “5059 250 | 126 
| Book Keepers ae 1°3172 | 280 369 1°0238 158 162 
Collectors see “5344 581 310 ‘5780 216 125 
Merchants ave 5539 813 450 “5898 310 183 
Butchers ae “9094 445 405 *8159 78 64 

i Bakers G3 9422 510 481 "8350 128 107 

i Police eae *4814 603 290 *5328 93 50 

{ Printers «| -eaeee 221 280 “9965 37 37 | 

i Tailors ma "8402 584 491 *7804 143 112 | 

{ | Barbers 1°2119 275 333 9556 50 48 | 

Plumbers ... | 14966 82 123 1-077 62 67 | 
| Bootmakers Ze "6656 341 227 "6788 62 42 | 
| Carpenters ee 4841 777 376 D375 83 45 
| Farmers ; 4525 77 397 5206 165 86 
| Servants a 1°2513 443 554 ‘9817 74°} 73 
| Sailors — 5942 1131 672 6255 |) 168 105 

Blacksmiths a “5828 1073 625 “6143 123 76 

Masons Me 5210 980 511 5665 181 103 

Iron-workers oo 1°0422 315 328 "8848 29 26 

Draymen a 1-0106 226 228 "8687 59 51 

Labourers he "7792 691 538 ‘7511 79 59 

Factory Operators ... 13636 345 470 1°0565 40 42 

Pedlars we ‘7416 627 465 ‘7257 60 44 


TABLE XII. 
Correlation Values. 


Corrected Cancer Death-Rate and Meteorological Factor. 
In 27 Cities, 1906. 


Sunshine oon hte ..) «P= — 01424 1323 

Mean Temperature we P= — 11478 +°1294 

: Mean Maximum Temperature = —*2236+°1257 
Mean Minimum Temperature r= —°0146+°1323 

Rainfall nae = wee) P= — 2517 + 71289 

Days of Rain ... “es we) «6 P= — 1621 +°1288 

Absolute Humidity ... ww.) «P= — 201441269 





39—2 
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TABLE XII.—(continued). 
In 17 Cities, 1906. 
Sunshine ; a .. r= '1268+°1659 
Rainfall < ie .. r= '3163+°1518 
Mean Temperature r= — 0032 + °1685 
Mean Minimum Seebeciane r= ‘1254+ °1659 
Mean Maximum Temperature r= —*1509+-°1600 
Relative Humidity 8 a.m. r= °*3101+°1519 
Relative Humidity 8 p.m. r= *1603+°1600 
Range of Humidity y= *0724+°1685 
Correlations, 15 States, 1906. 
Cancer and Diabetes, 1906 Ta 0303, p’ = °8258 + 0554 
Cancer and Suicides =°4412+°'1402, p'=*3801+°1490 
Cancer and Insane icniestal, ‘31/12/03 a 8711+ °0420, p’ =*8222+ :0564 
Cancer and Insane Admissions, 1904 p=°9173 +0278, p’='9179+ ‘0274 
Cancer and Newspapers to persons, 1900 p=*6182+°1076, p’=*5686+°1178 


Diabetes and Insane Admissions 


p= "8074+ 0608, 
Suicides and Insane Enumerated 


p= "6062 +1101 
1900—1904. 


p’ = "7906 + 0653 


Correlations, Cities, 





Cancer with Diabetes, 1900—04 p=°6896+°0559, p’= °7325+ 0494 
Cancer with Nephritis, 1900—-04 p='3792+°0913, p’= °1571+°1040 
Cancer with Suicides, 1900—04 = 3962+ 0899, p’= 6895+ °0559 
Cancer with Population, 1900 we =p='1049+'1055, p’= *4792+ 0822 
Diabetes with Population, 1900 p=°3007+°0970, p’= 4084+ 0889 
Cancer with Ill-defined Causes p="1020+°1055, p’= — °3453+ °0939 
Diabetes with Ill-detined Causes p='0773+°1060, p’= —°0193+°1066 
Cancer with Percentage Error of Registration p=*0526+°1064 


Cancer Death-Rate according to Birthplace of Mother, 
United States of America, 1900, with 


Cancer 


” 


Diabetes 
Suicides 


p= "8609 + 0552, 


Death-Rate according 


States of America, 
Diabetes (8 Large Groups) p= — 0645 + ‘2357, 


(28 Sub-groups) 


p= 


p’ = "5442+ "1501 
p’ = °6731 +1166 


to Occupation, United 
with 


1900, 


Occupation (Social Rank) with 

= — 5176+ °1746 
"9374 + °0289 

— *4574+ °1007 
*6716 + ‘0700 


Cancer (8 groups) r 
groups) 7 
Cancer (28 groups) ry 
Diabetes (28 groups) 7 


Diabetes (8 


p’ = — 6719+ °1308 
‘28664°1170, p’=— 4832+ 0977 


Death-Rate of Cancer by 5-yearly periods, 1871—75 to 1901—05, 
England and Wales, with 
Diabetes Death-Rate p= *8060 + 0893 














A QUANTITATIVE STUDY OF THE MORPHOLOGY OF 


THE FRUIT OF THE BLOODROOT, SANGUINARIA 
CANADENSIS. 


By J. ARTHUR HARRIS, Pu.D., Carnegie Institution of Washington. 


I. INTRODUCTION. 


HERETOFORE investigations in variation and correlation carried out by means 
of modern quantitative methods have been largely for the purpose of throwing 
light upon the problems of evolution. But for several years I have been convinced 
that the methods of higher statistics should find an extensive application in the 
fields of morphology and physiology. 


The present memoir is one of a series in course of preparation in which an 
attempt has been made to apply the statistical methods to the problems of the 
morphology and physiology of the inflorescence and fruit. The results set forth 
will not show their full significance until the constants obtained for other forms are 
laid beside them for comparison. These data will be ready shortly, but it seems 
best not to anticipate the results from material not completely worked up. 


II. MATERIAL AND METHODs. 
(a) The Morphology of the Fruit of Sanguinaria. 


Sanguinaria, a monotypic genus of the Papaveraceae, widely distributed over 
the eastern portion of North America, is so familiar to botanists that a detailed 
discussion of the superficial form of the fruit borne on its slender pedicel is hardly 
necessary. Gray* has given the essentials when he writes :— 


“Ovary oblong, one-celled, with two parietal placentae ; style short, columnar ; 
stigma broad, suleate—two-lobed, the lobes alternate with the placentae. Ovules 
very numerous, horizontal in several rows on the two placentae, anatropous. 





* Gray, A., Gen. Pl. U.S. Vol. 1. p. 115, 1848. 


————————————————————————— 


i 
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“Capsule oblong, somewhat compressed, herbaceo-coriaceous, many-seeded, 
pointed by the short, persistent style, two-valved; the valves separating from 
the replum formed by the intervalvular filiform placentae.” 


oa 





Fic. 1. Diagram of fruit of Sanguinaria, opened in the upper part on one side, showing two 


placentae with matured seeds and abortive ovules. 


I am unaware of any investigation of the embryological development of the 
ovary of Sanguinaria. Until such data are available we shall not be able to say 
more concerning the morphology of the fruit than Gray did sixty years ago. The 
essential points to be remembered in the present work are the following. The 
fruit is borne on a slender pedicel sent up from the root-stalk. It is one-celled 
with two parietal placentae on which the seeds are arranged in two or three 
irregular rows. Judging by analogy with other ovaries whose embryology is 
known, it would seem probable that each placenta is composed of the margins 
of two distinct carpels. 


(b) Collection of Material and Characters considered. 


In the spring of 1906 I collected at Meramec Highlands, near St Louis, Mo., 
a series of 1000 fruiting stalks of Sanguinaria. At the time, the stalks seemed to 
have their full growth, so far as could be judged by general criteria, but the seeds 
were not fully mature. 

In the spring of 1907 another lot of fruits was secured from the same habitat. 
Only 400 were gathered, and in this case the peduncle was not taken. This 
second collection was made considerably later in the season than the first, and 


the fruits were sensibly larger, as will appear when the means of fruit lengths 
are considered. 
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From these collections the following measurements and countings were made :— 


1. Length of Peduncle. 

2. Length of Fruit. 

3. Seeds developing on first placenta. 
4. Ovules formed on first placenta. 

5. Seeds developing on second placenta. 
6. Ovules formed on second placenta. 

7. Aborted ovules on first placenta. 

8. Aborted ovules on second placenta. 
9. Total seeds developing per fruit. 

10. ‘Total ovules formed per fruit. 


11. Total aborted ovules per fruit. 
Some words of explanation concerning these characters are necessary. 


1. Length of Peduncle. A study of the length of the peduncle was not a 
primary purpose of this investigation, but it was taken up in a rough and ready 
manner to gain some general appreciation of the influence which it might have 
upon the magnitude of the characters of the fruit. It is idle to expect to attain 
great precision in the investigation of a character of this kind. 


Sanguinaria is a perennial plant and we do not know the age of the individual 
root-stalks which produced these peduncles and fruits. The plants were very 
probably of different ages; all that we can say with certainty is that they were 
mature enough to produce fruiting stalks. It is quite possible that the age of 


the individual has a material influence upon the character of the flowering stalk 
which it produces. 


There is unquestionably some error in the determination of the length of the 
stalks due to the method of collecting. They were simply carefully pulled from 
the rhizome. Considering the nature of the material with which we are dealing, 
I think the determinations of stalk length were quite accurate enough for 
practical purposes. . 


2. Length of Fruit. In both collections the length of the fruit was taken to 
the nearest millimetre. 


As a character for investigation the length of the fruit presents some serious 
difficulties. The fruit is not infrequently somewhat curved or otherwise irregular 
in form, and there is relatively a large possible error in measurement. The fruits 
collected in 1906 had not quite attained their full size, but I was afraid to leave 
the collection until later because of the difficulty of counting the seeds which 


very easily become detached. In 1907 the material had more nearly attained 
its full development, 
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To reduce the distributions to somewhat more workable limits the fruits have 
been combined in groups of 3 mm. range. These have proved very satisfactory 
working units. Constants calculated from them are essentially the same as those 
calculated from the ungrouped distributions, as I have assured myself by actually 
carrying out the arithmetic. 


3—11. Meristic Characters of the Fruit. The counting of the number of 
ovules formed, the number of seeds developing, and the number of abortive ovules 
is not at all difficult in this species. The fruits were of slightly different ages in 
the two collections, but in both cases they were old enough for the determination 
of the number of seeds developing to be entirely trustworthy. 


In recording these characters the numbers on each placenta were noted 
separately. In the following pages the placentae are referred to as “first” and 
“second.” The two are not differentiated at all, the “first” placenta being 
merely the first one counted when the fruit was opened. The constants for 
the distribution of ovules, seeds and aborted ovules should be the same for the 
two placentae within the limits of the probable error of random sampling. In 
investigating many relationships the tables may be made symmetrical, that is to 
say each placenta may be used once as the “first” and once as the “second” 
number of a pair. The advantage of using these symmetrical tables has been 
several times pointed out in biometric literature*. 


I have found it rather an advantage to work with ovules, seeds and aborted 
ovules. Naturally, the seeds developing and the aborted ovules per fruit equal 
the number of ovules formed, and the tables may easily be verified if all three 
characters be used. Characters 9, 10 and 11 are obtained by adding 3 and 5, 
4 and 6, 7 and 8. 


(c) Method of Reduction of Data. 


Sheppard’s correction was used throughout in the calculation of the standard 
deviations. 


In calculating the probable errors where more than one organ from each 
individual is measured, there is always some question as to what value of WV 
shall be used. I have taken N as the actual number of flowering stalks in all 
cases in which one of the characters involved in the constant was measured only 
a single time for each inflorescence. For instance in correlating between length 
of fruit and number of seeds per placenta, or between the fertility characters of 
the two placentae of the same fruit, N was used as 1000 and 400 in the two 


* For instance, by Pearson and his assistants in their memoir on homotyposis in the vegetable 
kingdom (Phil. Trans. Roy. Soc. Lond. A, Vol. cxcvm. pp. 285—379); by Pearl on Paramecium 
(Biometrika, Vol. v. pp. 249—251, 1907) ; by Pearson and Barrington in their memoir on the inheritance 
in vision (Francis Galton Lab. for Nat. Eugenics, 5, 1909) ; by Harris (Biometrika, Vol. vi. pp. 214— 
218, 1909). 
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collections. But when the constants are calculated for the repeated organs it 
seems logical to calculate the probable errors on the basis of the actual number 


of such organs, and this was done, for the constants calculated for the characters 
of the individual placentae. 


III. PRopteMs CONSIDERED. 
1. Type and Variability of the Characters Investigated in the 
Two Collections. 


The fundamental physical constants are given in Tables 1 and 2. It has 
seemed best to table each collection separately and to put them side by side 
for comparison in a third table. The distributions upon which these constants 
are based will be found as the totals of the correlation tables. The differences of 
the constants to be compared are given with their probable errors in Table 3. 


TABLE 1. 


Constants for 1906 Collection of Sanguinaria. 








| Average Standard | Coefficient 

| Character and | Deviation and | of 

| Probable Error | Probable Error | Variation 

ee 

Length of Stalk in 5 cm. Units* ... ..- | 170904058 | 2°736+-041 16°01 
Length of Fruit in 3 mm. Unitst ... oes ... | 39°703+°134 | 6307+:095 | 15-88 
Ovules on First Placenta ... poe ¥ ... | 14980+°097 | 4569+ °069 | 30°50 
Ovules on Second Placenta ... | 14°9924+°099 | 4°658+°070 | 31°21 
Ovules on First and on Second Pl: wcenta . ... | 14:952+°070 | 4614+:049 | 30°86 | 
Total Ovules per Fruit 5 ved ioe w. | 29°9904+°193 | 9°054+-136 | 30°28 

| Seeds on First Placenta... “ia - ... | 11°584+°083 | 3°8724+°058 | 33°42 
Seeds on Second Placenta ... = ... | 11:471+4°085 | 3°977+°060 34°67 
Seeds on First and on Second Placenta sits ... | 11°527+°059 | 3°925+ 042 34°05 
Total Seeds per Fruit : ne ... | 23°055+°161 | 7°5314+°114 32°66 
Aborted Ovules on First Placenta... a Bua 3°396 + ‘060 | 2°827+°043 83°25 
Aborted Ovules on Second Placenta 2 3°453+°060 2°802+°042 81°14 
Aborted Ovules on First and on Second Placenta 3°424+ 042 | 2°812+°030 82°12 
Total Aborted Ovules per Fruit ... 4 ... | 6849+°110 | 5°162+-078 75°36 


* Constants tabled in units of 1 cm. 
+ Constants tabled in units of 1 mm. 


Another constant which is of much service in studies of fertility is not given 
in either of these tables. This is the ratio of the total seeds developing to the 
total ovules formed. 


Several years ago Pearsont defined fecundity as the ratio of the number of 
offspring actually produced to those which might have come into existence 
under the circumstances. His definition was formulated for use in a study of 


t Phil. Trans, Roy. Soc, Lond. A, Vol, cxcu, p. 261, 1899. 
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inheritance in thoroughbred horses. It seems to me a good one and I shall 
use it throughout a series of studies on fertility in plants, modifying the terms 
involved as may be necessary but keeping always to the general idea of the 
definition. Mayer* used the same ratio in a study of symmetry and fertility in 
a Medusa. He used the term “coefficient of fertility.” Coefficient of fecundity 
is perhaps a more suitable term, agreeing with Pearson’s terminology. The 
coefficient of fecundity is a very useful constant. Its value lies between 0 and 1, 
and its form is therefore similar to that of some other statistical constants. 


TABLE 2. 
Constants for 1907 Collection of Sanguinaria. 





) ” 

Average | Standard Coefficient | 

Character and | Deviation and of 
Probable Error Probable Error | Variation 


aene i eet RES 


Length of Fruit in 3mm. Unitst ... eA ... | 51°865+°268 | 7:°948+-190 15°32 
Ovules on First Placenta ... am sie ... | 12°812+°137 | 4:052+-097 31°63 
Ovules on Second Placenta ... | 12°792+°136 | 4+ 24 + 096 31°46 
Ovules on First and on Second Placenta ... | 12°302+°096 | 4°038+°068 | 31°54 
| Total Ovules per Fruit iss ‘e ase ... | 25°605+°265 | 7°850+°187 | 30°66 
Seeds on First Placenta... ae “6% ee 9°717+°134 | 3:980+°095 | 40°96 
Seeds on Second Placenta ... ous si 97954°135 | 3:991+°095 | 40°74 
Seeds on First and on Second Plac centa eae - 9°756+°095 | 3°986+-067 40°85 
Total Seeds per Fruit : <6 eg . 19°512+°258 | 7660+°183 | 39°26 
Aborted Ovules on First Pla ac centa mae Bes 3095+ 086 | 2°559+-061 82°68 
| Aborted Ovules on Second Placenta _... 2°997 +088 | 2°610+°062 | 87°09 
Aborted Ovules on First and on Second Pk wenta 3°046+ 062 | 2°585+-044 84°86 | 
Total Aborted Ovules per Fruit ... es ey 6°092+°161 | 4°785+°114 78°55 


+ Constants tabulated in units of 1 mm. 
In the present case the coefficient of fecundity is the ratio of the total seeds 
developing in the population to the total ovules produced. 


For the present series of data we have :— 


. ; _ 23055 ve a 

For 1906 series, ¢.F. = 39904 ‘7710 + 00165, 

" : 7805 = 

For 1907 series, c.F.= re = ‘7621 +0042, 
Difference, — ‘0089 + 0045. 


* Mayer, A. G., Sci. Bull. Brook. Mus. Vol. 1. 1901; see also C. B. Davenport, Biometrika, Vol. 1. 
pp. 255, 256, 1902. 

+ It is known that if y, be the frequency of any class of individuals in a population of m individuals, 
the probable error of the frequency is given (Biometrika, Vol. 11. p. 274, 1903) by the formula 


ky, =-07449 x/ y, x (a : ue) 
m 


In the present case y, is the number of seeds developing in the population m, or total number of 
ovules. From the absolute probable error of the number of seeds developing calculated from this 
formula, the probable error of the coefficient of fecundity was obtained by taking the ratio of the 
absolute probable error to m, the total number of ovules, 
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In the present paper the study of variation has been a means to an end. The 
standard deviations have been necessary in the calculation of the coefficients of 
correlation, and the consideration of the interrelationships between different parts 
of the organism as indicated by the coefficient of correlation has been the main 


purpose of the paper. 


TABLE 3. 


Comparison of Constants for 1906 and 1907 Collections of Sanguinaria. 


| } 

Average Standard | Coefficient 
Character | and Deviation and of 

| Probable Error Probable Error Variation 











| j 

Length of Fruit, 1906 = ae ... | 39°7034°135 6307 £°095 | 15°89+ -23 
na se 1907 ie .. | 518654268 | 7°948+°190 | 1533+ 37 | 
Difference | +12°162+°300 | +16414-212 |- 56+ 43 

| 

| Ovules per Placenta, 1906 a ... | 14°952+°070 4°613+°049 | 30°86+ °51 | 
| 2 ‘ 1907 Si ... | 12:8034+°696 | 4:038+-068 | 3164+ 83 | 
Difference | —2°149+°119 | — °575+°084 + ‘78+ ‘98 | 
| Total Ovules per Fruit, 1906... wae 29°904 + -193 9°055 +137 | 30°28+ 50 | 
@ 2 i 1907... ... | 25°605 +265 7850 +°187 | 30°66+ -80 | 
| Difference | —4-2994°328 | —1:2054+°232 + “38+ ‘94 | 
| Seeds per Placenta, 1906 ... a aes 11°527 + ‘059 3°925+°042 | 34°05+ ‘57 
- < 1907 ... ae Se 9°756 +°095 3°986 + 067 | 40°85+1-13 | 
Difference | —1°7714°112 | + -0614°079 |+6:80+1-26 | 

| 
| Total Seeds per Fruit, 1906 55 mt 23°055 +°161 7°531+°114 32°66+ °54 
is » 1907 2 ... | 19°513+-258 7-660 +:183 | 39°26+1-07 
| Difference | —3°5424°304 | + 1294-215 |4+6-60+41-20 | 
Aborted Ovules per Placenta, 1906 uA 3°424 + 042 2°812 +030 | 82°12 +2°69 
s % ss 1907 re 3°046 + 062 2°585+°044 | 84°87 +317 | 
Difference | — -378+°075 | — °227+°053 |+2°75+4:15 | 


Total Aborted Ovules per Fruit, 1906... 6°849 + °110 5161 +:078 | 75°36 + 2°43 
a : . » 1907 ...| 60934161 | 4:7854°114 | 7854427 
Difference | — °756+4°195 — 376+°141 |4+3°18+ 


| \ 


The constants for the two years considered individually require no separate 
discussion. The means are not of interest except in comparison with the same 
constants for other races of Sunguinaria. The standard deviations and coefficients 
of variation are of about the order generally found for plant characters. 

The comparison table shows that some of the differences are several times 
their probable errors, and hence clearly significant. For length of fruit this is 
quite obviously due to the somewhat greater maturity of the 1907 series. Here 
I do not care to discuss in detail the interpretation to be assigned to these 
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differences. I must point out, however, that the fact that our probable errors 
show that there are significant differences in the samples with which we are 
dealing does not justify us in concluding that there are differences in the popu- 
lations which they represent unless we are quite sure that our samples have 
been drawn in precisely the same way. Rocks, steep banks and gentle slopes, 
fallen logs and drifts of leaves are possibly all factors which make for heterogeneity 
in the environment of the individual organism in a plant formation which the 
ecologist would consider quite homogeneous. In collecting two samples a and b 
from such a habitat we might get constants differing sensibly with regard to their 
probable error merely by failing to have the several subhabitats contribute the 
same proportional number of individuals in each case. 


These points are emphasized because I do not wish to have these results 
interpreted as indicating that there is a real difference between the whole 
population of Sanguinaria at Meramec Highlands in the two years in question 
due to some peculiarity in the seasons themselves. There may be such a 
difference, but there is no strong evidence for such a conclusion in our data 
which were not collected in a way, nor for the purpose, to determine whether 
there is a real difference between the whole populations in different seasons. 


Turning aside from the contrasting of the constants for individual characters 
and taking a general view of the whole comparison table, I think the impression 
must be that the agreement between the two collections is very close. 

I think we may consider the two series of material upon which we propose to 
investigate certain morphological relationships as on the whole very similar in 
nature. Tests for the similarity of series of material used in morphological work 
have rarely been applied, but it would be desirable if some attention could be 
given to points like this. 


2. The Correlation between the Length of the Peduncle and the Length 
of the Frutt. 
It seems important to obtain some idea of the relationship between the degree 
of development of the fruit-stalk and the size attained by the fruit. The data 


appear in Table I, 1906, and I have calculated both the coefficient of correlation, r, 
and the correlation ratio, 7. 


This latter constant, recently introduced by Pearson*, furnishes an adequate 
description of the interdependence of two variables when the rate of change of 
the dependent variable cannot be expressed by the slope of a straight line. 


The two constants have the values 

335 +019 
n ='379 + 018+ 

n—7r= ‘044 


r 


ll 


* Pearson, K., Drapers’ Co. Research Memoirs, Biometric Series, No. 2, Dulau & Co., 1905. 
+ Here EL, was calculated by Pearson’s short formula, loc. cit. p. 20, 1905. 
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It appears, therefore, that there is a very substantial interdependence between 
the length of the peduncle and the length of the fruit. It will be borne in mind, 
however, that the fruit in this serigs had not quite reached its maximum size, and 
that owing to difficulties inherent in the species, constants involving peduncle 
length and fruit length will always be open to some question. 


Blakeman* has shown that, if & = y?— 7°, a good test for linearity of regression 
is given by using the formula 


ae: ae 1 


E, 067440°2%* Ji¢a—my-d—ny 


How large the value of &/H; shall be to be considered significant must be 
decided by each writer for himself, but when we find £/£;>2°5 we may begin 
to be suspicious of the linearity of our regressions. From our data we find 
Length of Fruit. 
28 29 30 3I 32 33 34 35 36 37 38 39 40 41 




















— ——+— 





Dracram I. Regression of fruit length on peduncle length. (Means depending on a few 
isolated observations not joined up.) 


* Blakeman, J., Biometrika, Vol. 1v. pp. 335—350, 1905. 
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& =03149, and &/H;=4°279. It seems rather questionable, therefore, whether 
our regression line can be regarded as strictly linear within the limits of the 
probable errors of random sampling. 


The equation of the regression straight line is 
y = 26517 +7722, 


I have plotted the means of arrays and find that they are scattered with some 
irregularity on either side of the theoretical line, but considering the nature of our 
material the fit is perhaps as good as could be expected. There is no suggestion 
in the graph that any other curve would give better results. 


The biological significance of this constant showing the interdependence 
between length of flowering stalk and length of fruit seems to me to be chiefly 
descriptive. We cannot say that the length of the stalk determines to some 
extent the length of the fruit, or vice versa. In short, we have not “explained ” 
anything, as we biologists are so wont to do. But we have measured a relation- 
ship and recorded it in a form that may be directly compared with other similar 
measurements when they shall have been made, and when the series so described 
are enough, comparison will permit of pushing problems further. 


3. The Relationship between the Length of the Peduncle and the 
Fertility of the Fruit. 


We have little definite knowledge of the relationship between the physical 
development of an organism and its fertility. In man Pearson* has brought 
forward evidence to show that tall women are more fertile than short ones. I am 
not aware of further published evidence for animals, though Professor Pearson 
informs me that data for other mammals are in existence and will shortly be 
published. 


In plant material it has been shown in an earlier paper+ that there is a 
correlation of about ‘5 or over for length of flowering stalk and number of flowers 
per umbel in Nothoscordium and Allium. In the case of Cercist and Celastrus§, 
however, there seems to be no correlation between the size of the inflorescence 
as measured by the number of flowers or fruits produced and the fertility of the 
ovaries. Before we can go very far into the problem of the relationship between 
the degree of development of the vegetative and reproductive characters we must 
have together a sufficient number of series of reduced data to enable us to make 
extensive comparisons. 


In the table of correlation constants I have given the relationship between 
the length of the peduncle and the fertility characters of “first” and “second” 


* Pearson, K., Proc. Roy. Soc. Vol. tix. pp, 301—305, 1896. 
+ Harris, J. Arthur, Ann. Rept. Mo. Bot. Gard. Vol. xx. 1909. 
t Harris, J. Arthur, Biometrika. In press. 

§ Harris, J. Arthur, Ann. Rept. Mo. Bot. Gard. Vol. xx. 1909. 
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locules separately. The constants were worked out largely to furnish an extra 


safeguard against arithmetical blunders, but the results seem worth publishing 
as this may indicate to the non-statistical reader something of the amount of 
divergence between constants which may arise purely as a result of chance. 


The correlation coefficients and their probable errors are given in Table 4. In 
two cases I have also compared the correlation ratio, 7, See Table 5. In these 
two cases Blakeman’s formula cited above gives :— 

For length and ovules, &/H; = 4125, 
For length and seeds, &/H#; = 4131. 


Here again there is some question concerning the linearity of our regression. The 
equations to the regression lines are :— 


For ovules per placenta, y = 5863 + °532z, 
For seeds per placenta, y= 2°987 + 5002. 


Number of Ovules and Number of Seeds. 





Length of Stalk. 

















Diacram II. Regression of number of ovules per placenta on peduncle length and of number 


of seeds per placenta on peduncle length. 


Solid dots and line=ovules ; circles and dotted 
line = seeds. 


(Means depending on a few isolated observations not joined up.) 
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In neither case can the straight line be considered very satisfactory, but at present 
I do not see the advantage of seeking for further regression equations. When 
more biological data are available it may be profitable to go further with the 
statistical analysis. 


TABLE 4. 


Correlation between Length of Peduncle and the 
Fruit in Sanguinaria. 


Fertility of the 


Relationship between Length of Stalk and 





Ovules on First Placenta 

Ovules on Second Placenta ' 

Ovules on First and on Second Placenta 

Total Ovules per Fruit . 

Seeds on First Placenta — 

Seeds on Second Placenta 

Seeds on First and on Second Placenta 
| Total Seeds per Fruit 


| Correlation and 
| Probable Error 
| 


‘313 +019 
321 +°019 
‘315 +019 
323 + 019 
‘341 +-019 
‘356 +019 
‘348+-019 | 
363 + ‘019 





Aborted Ovules on First Placenta 039 + ‘021 
Aborted Ovules on Second Placenta ... 029 + 021 
Aborted Ovules on First and on Second Placenta... 034 + 021 


Total Aborted Ovules per Fruit 037 + 021 





TABLE 5. 
| Relationship r : q~4 
Length and Ovules per Placenta ... 315+ 019 361+ ‘019 046 | 
Length and Seeds per Placenta 348 +019 388 +018 040 


of inter- 
between the length of the fruit-stalk and the number of ovules 
formed and the number of seeds developing as between the length of the fruit- 
stalk and the length of the fruit. In the case of both ovules and seeds the 
correlation between length of peduncle and the total number per fruit is slightly, 
though not significantly, higher than for the numbers per placenta. 


These constants show that there is essentially the same degree 
dependence 


A question in which the biologist is interested is this:—Is the relationship 
between the length of the peduncle and the number of seeds developing more 
intimate than that between the length of the peduncle and the number of ovules 
formed ? 


Suppose for a moment that the degree of development of the peduncle affords 
some rough measure of the “vigor” of the plant. I think no physiologist can 
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object to it as a working hypothesis, and I dare say that most would take it for 
granted. Now it is important to know whether the vigor of the plant shows 
itself more in the number of seeds developing—a character which is not definitely 
and unchangeably fixed until after a considerable period of physiological activity— 
than in the number of ovules formed—a character definitely and unchangeably 
established relatively early in the development of the organ we are studying. 


If vigor finds fuller expression in the number of seeds developing than in the 
number of ovules formed we should expect to find a higher coefficient of correlation 
between the length of stalk and the number of seeds than between the length of 
stalk and the number of ovules. The comparison is made in Table 6. 


TABLE 6. 


Comparison of Relationship between Stalk-Length and Number of 
Ovules and Stalk-Length and Number of Seeds. 


: Correlation and 
Relation between Stalk-Length and | Deohakle Secor 


: 





Ovules on First and on Second Placenta ... | 315+ 019 
Seeds on First and on Second Placenta ... | 348 +°019 
| Difference ... +°033 + 027 


| 

| | 

| Total Ovules per Fruit ve ae "3234019 | 

Total Seeds per Fruit rae oe on 3634019 | 
Difference ... aioe +°040 + 027 


It appears that while actually the correlation between length of stalk and 
number of seeds is slightly higher than that between length of stalk and number 


of ovules, the difference is so small as to be insignificant in comparison with its 
probable error. 


4. The Relationship between the Length of the Fruit and its Fertility. 


That there should be a fairly close interdependence between the size of a fruit 
like that of Sunguinaria and the number of ovules formed and the number of 
seeds developing is to be expected. The placental space in a fruit of this species 
is pretty well filled up with ovules, and if there are many ovules the placentae 
must be longer than if there are only a few. The demonstration by means of 
a “formidable mathematical apparatus” of this rather obvious fact is not in itself 
a matter of much weight, but it is of importance to measure the degree of 
resemblance instead of merely asserting that a relationship exists, and to express 
the result of the measurement in such terms that it may easily and directly be 
compared with like measures in other species. 


The constants for both lots of data are compared in Table 7. 
Biometrika vi 41 











318 Morphology of the Fruit of Sanguinaria Canadensis 
TABLE 7. 


Correlation between Length of Fruit and Number of Ovules Formed 
and Seeds demrsindeety per Placenta and per Fruit. 


| Correlation and Probable Error 
Relationship between Length of Fruit and ——_—_—_______—— 
| 


1906 1907 Difference 
* I aetini “ 
Ovules on First and on Second Placenta... .. | 517+°016 | 638+°020 | +°121+ 025 
Total Ovules per Fruit J . | 5274°015 | 6564-019 | +°131+°025 
Seeds on First and on Second Placenta | 552+°015 | 661+°019 | +°109+°024 
Total Seeds per Fruit = | ‘576+°014 | 688+°018 | +°112+°023 
| Aborted Ovules on First and on Second ‘Place enta 078+ °021 | 0238+-034 | —-106+-040 
Total Aborted Ovules per Fruit 085 +021 | 025+°034 | —:110+°040 


| 
| 
| 


The equations to the regression straight lines for length of pod and ovules per 
placenta and for length of pod and seeds developing per placenta in the two series 
are given in Table 8. Table 9 affords a comparison of 7 and 7 for these relation- 
ships. 


TABLE 8. 


Equations to Regression Straight Lines for Fertility Characters 
on Pod Length. 


Regression of Fertility Characters on Pod Length | Equation to Straight Line 


| Ovules per Placenta on Pod Length, 1906 ... y= 06343782 
Seeds per Placenta on Pod Length, 1906 ... y= — 21154 344a 
Ovules per Placenta on Pod Length, 1907 ... y= — 4000 + 324% 
Seeds per Placenta on Pod Length, 1907 ... Y= — 7438 + 3322 


TABLE 9. 
Comparison of + and » and Blakeman’s Test for the Linearity of Regression 
for Fruit Length and Fertility Characters. 


| 
: . Coefficient of | Correlation 
Relationship 


Correlation, r ratio, 7 | rag g/Ke 
Ovules and Length, 1906 ‘S17 +016 | 532+°015_ | 016 2‘971 
Seeds and Length, 1906 52+°015 | 552+°015 — -- 
Ovules and Length, 1907 ‘638 + 020 637 +°020 | 


Seeds and Length, 


907 661 +°019 677+ 018 021 | 2°160 
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Tn both of the cases in which £/E; was calculated it is found to have a value 
not greatly exceeding 2°5 in the first case and falling somewhat short in the second. 
Judging merely from the statistics in hand, the deviations of the means of arrays 
from a straight line may be due to nothing more than the probable errors of 
random sampling. In the case of length and seeds for 1906 and length and ovules 
for 1907, &/Z; was not calculated, since 7 is slightly less than r. This theoretically 
impossible result is explained as follows. In calculating » I followed the con- 
ventional practice of using the standard deviation for the population calculated 
without Sheppard’s modification. In calculating r, however, Sheppard’s correction 
was applied to both standard deviations. Had standard deviations without 
Sheppard’s correction been used, r would have been lowered somewhat by the 
increase in the value of the product o,o,, and would then be less than ». It 
seems rather pedantic under the circumstances to recalculate the values of r, 
for biologically the results could have little increased value. For one year it is 
the correlation for length and ovules and for the other year it is the correlation 
for length and seeds which give such nearly identical values of r and ». There is, 
therefore, no reason to think that the regression may be non-linear for ovules 
and not for seeds, or vice versa. 


For both ovules and seeds the constants for 1907 are about 0°110 + ‘025 higher 
than in 1906. 


Possibly the more mature condition of the fruits gathered in 1907 may have 


as one of its results the raising of the coefficient of correlation for length of fruit 
and number of ovules or seeds. 


These constants are purely descriptive. They express the degree of inter- 
dependence between length of fruit and number of ovules or seeds but do not 
explain anything concerning the cause of this interdependence. The correlation 
coefficients in themselves cannot even be considered adequate to establish the 
existence of a causal nexus inherent in the fruit primordium and finding expression 
in the relationship observed between length of fruit and number of ovules or 
seeds. There are two ways in which a correlation between fruit length and 


fertility might arise independently of any cause for such a relationship innate 
in the fruit. 


(a) Heterogeneity in the individuals from which the fruits were taken. 


(b) Correlation between both the length and the fertility of the fruit and 
some other organ of the plant, say length of peduncle. 


The first of these possible sources of correlation is not open to investigation 
on the basis of our material. The second may more easily be taken into 
account. 


Both length of fruit and number of ovules and seeds are correlated with length 
of peduncle. What influence will this have upon the correlation between the 
length of the fruit and its fertility? The best way of getting at this problem seems 
to be to determine the correlation between the length of the pod and its fertility 
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characters for constant length of stem. In doing this we strip off, so to speak, 
the influence of the length of the stalk upon the coefficient of correlation 
between length and fertility of fruit and have our relationship uninfluenced by 
this character. 

The familiar formula for the partial correlation coefficient is 


on ee tee 
he ee 


— 1x7 N1 ry? 





In the present series # is length of fruit, y the number of ovules or seeds, as the 


case may be, and z, whose influence on the gross correlation is to be eliminated, is 
the length of the peduncle. 


The partial correlation coefficients for the total ovules and total seeds for the 
1906 series calculated by the above formula appear in Table 10. 


TABLE 10. 


Correlation between Length of Fruit and Number of Ovules and Seeds 
per Fruit for Variable and Constant Lengths of Peduncle. 


| Relationship between Length of Fruit and | Correlation 


Total Ovules per Pod | 
| 


| 
| 
| For Variable Length of Peduncle ie 527 
For Constant Length of Peduncle a “470 
| Difference ... wae 057 


Total Seeds per Pod 


For Variable Length of Peduncle ee ‘576 
| For Constant Length of Peduncle wes ‘517 
| Difference... ies 059 


The differences show that there is a small amount of the relationship between 
the length of the fruit aud the number of ovules or seeds which it produces which 
is due to the correlation of both of the characters of the associated pair with 
peduncle length. But this influence is comparatively small. 


A question of very real interest is that concerning the relative magnitude cf 
the correlation between the length of the fruit and the number of ovules and the 
length of the fruit and the number of seeds. 


The results may be extracted from 
Table 7 for comparison in Table 11. 


While the correlation for length and seeds is the higher in both series, the 
excess above the relationship for length and ovules cannot be given much weight. 
I was at first somewhat surprised at this result, for I had rather expected that the 
length of a soft fruit of this kind was much more dependent upon the number of 
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seeds developing than upon the number of ovules formed. We are dealing, how- 
ever, with a form in which the correlation between the number of ovules formed 


TABLE 11. 


Comparison of Correlation for Fruit Length and Number of Ovules 
and Fruit Length and Number of Seeds. 








Relationship Correlation in 1906 | Correlation in 1907 
= Beene 
Length and Ovules ane He 527+ °015 656+ 019 
| Length and Seeds an we ‘576+ 014 688 + °017 
bn So tee 
| Difference ae 049 + 021 | 032 + 026 


ere ts | 


and the number of seeds developing is high. Under such circumstances these 
results would be expected. The problem should be worked out on some form in 
which there is only a moderate correlation between the number of ovules and 
number of seeds. 


5. The Relationship between the Number of Ovules Formed and the 
Number of Seeds Developing. 


Our knowledge of the physiology of seed production is very limited. We know 
practically nothing of why one ovule in a locule develops into a mature seed and 
another beside it fails to develop. General explanations, such as “lack of 
nutrition,” “failure to be fertilized,” etc., are thoroughly unsatisfactory. Experi- 
mental evidence would be the most desirable, but until it is forthcoming I think 
we shall have to depend upon the biometric analysis of large bodies of data to 
enable us to get some insight into the problems. 


TABLE 12. 


Correlation for Fertility Characters in Sanguinaria. 


: : Correlation and Probable Error 
Relationship between Number of Ovules 


Formed and Number of Seeds — —~—— 


ns 1906 1907 Difference 
For First Placenta... ...... | 790140080 -7998+-0121 - 
| For Second Placenta ae ... | ‘8037+ °0076 *7905 + 0127 — 
For First and Second Placenta ... ‘7970 + 0055 ‘7951+ °0088 —-0019+ "0104 
For the Fruit ae a sae *8141 +0072 8106+ 0116 | — 0035 + 0136 
! 


The first question which suggests itself for investigation is whether an ovule 
formed in a pod producing a large number of ovules is more, or less, likely to 
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Total Seeds per Fruit. 


23 25 27 29 «3! 33 35 37-0 39 


41 


43 











4 


4. 


4. n rn rn 4 4 inn 








Diagram III. Regression of total seeds per fruit on total ovules per fruit. (Means depending 
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develop than one which is produced in a fruit with a smaller number of ovules. 
This will “explain” nothing when we arrive at the answer to this question. If we 
find that it has a better chance of developing in a fruit of some particular type we 
shall not know why it has this advantage over an ovule formed in a different type 
of pod. But we shall have definitely isolated one of the factors—or complexes of 
factors—which have an influence upon the development of an ovule into a seed 


and shall have quantitatively determined the amount of influence which it 
wields. 


The ordinary coefficients of correlation between the number of ovules and the 
number of seeds developing per pod are shown in Table 12. 

Two points in this table are noteworthy. First, the correlations are high. 
Second, the agreement between the two series of material is very close. The 


constants for the two collections differ by less than the probable error of their 
difference. 


The equations to the regression straight lines for total ovules and total seeds 
per fruit are :— 
For 1906 series, y= 2°808 + ‘6772, 
For 1907 series, y= — ‘741 + °7192. 
The comparison of » with the coefficients of correlation and the &/E; of . 
Blakeman’s test for the linearity of regression are given in Table 13. 


TABLE 13. 


Tests for Linearity of Regression in Correlation between Total 
Ovules and Total Seeds. 





Constant 1906 Series 1907 Series 
| 
Coefficient of Correlation, 7 eae *8141 + 0072 *8106 + 0116 } 
Correlation Ratio, 7 em aad 8317 + 0066 "8420 + -0098 
oo 2 beara 02896 “05199 
Blakeman’s Criterion, &/Z¢ ee 4-028 3°438 


I am rather surprised to find &/#; so high as it is in these two cases. The 
indications are that the divergence of the means from a straight line is rather 
larger than one would expect to be accounted for by the errors of sampling. 
From a graph, however, there is no clear indication that any curve would give any 
better fit than the straight line. Certainly the data in hand do not justify any 
further analysis along these lines. The two constants are so nearly identical that 
for any practical purpose it is immaterial which is employed to describe the 
relationship between number of ovules and number of seeds per fruit. 


While the coefficient of correlation shows the degree of interdependence of 
number of ovules and seeds it does not yield one other item of information which 
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is essential to our understanding of fertility. This is a knowledge of the influence 
of the number of ovules per fruit upon the chance of an individual ovule 
developing into a seed. This is quite a different problem from that of the 


relationship between the total number of ovules per pod and the total number of 
seeds developing. 


As pointed out in an earlier paper*, the constant which we need is the corre- 
lation between the number of ovules and the deviation of the number of seeds 
from their probable number. The coefficients calculated by the method described 
in that paper are given in Table 14. 


TABLE 14. 


Correlation between Number of Ovules and Deviation of Number of Seeds from 


the Probable Number on the Assumption of the same Proportional Fecundity 
throughout the Population. 


| Correlation and Probable Error 
| Relationship between Ovules | __ 


| and Seeds | 
| 1906 1907 | Difference 
- i e | ee cS 
| For both Placentae... —'178+ ‘015 “038 + °024 +°216 + 028 
| For the Fruit ... sit — ‘191+ ‘021 051 + 034 | + °241+ 039 


: | 


These results show how important it is to have more than one series of material 
upon which to base conclusions when the correlations are of a low order of 
magnitude. Considering only the 1906 series we see constants with a negative 
sign and about ten times their probable error. It would seem from these results 
alone that as the number of ovules per placenta or per fruit increased the capacity 
of the fruit for maturing its ovules into seeds decreased. But when the 1907 
series is taken into consideration also we note that the correlations are hardly 
twice their probable errors but with the positive sign. Of course it may be 
possible that a larger sampling in the 1907 plants would have given a sensible 
negative correlation. But all that we can conclude is that while there may be 
a relationship between the number of ovules per fruit and the capacity of the fruit 


for maturing its seeds there is no very satisfactory argument for such a theory in 
our present collections. 


These conclusions agree with the results announced for one sample of 10,000 
pods of Cercis given as an illustration in the paper on the method of determining 
the relationship between ovules and seeds, but do not agree with the results 


for the general sample of Robinia. The factors underlying differences of this kind 
deserve investigation. 


* Harris, J. Arthur, Biometrika, Vol. v1. pp. 438—443, 1909. 
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6. The Correlation between the two Placentae of the Fruit. 


The correlations between organs of the two sides are now available for a 
considerable number of bilaterally-symmetrical animal organisms but practically 
nothing has as yet been put into print concerning the similarity of the two sides 
of bilaterally symmetrical plant organs. The fruit of Sanguinaria furnishes a very 
suitable object for the determination of such relationships. 

Since the two placentae are quite undifferentiated the correlations may be 
determined from symmetrical tables. Our tables as published have not been 
rendered symmetrical. Anyone who desires may determine the constants from 
them as they stand. 1 have worked by both methods and find no sensible 
difference in results. The coefficients for symmetrical tables can easily be calculated 
from them by use of proper formulae*. For the constants see Table 15. 


TABLE 15. 


Organic Correlations for Fruits of Sanguinaria. 





Correlation and Probable Error 
Characters of the two Placentae 


| ; Enc oe eee 


| 
| 1906 | 1907 | Difference | 
| 
| | | 
Ovules of “First” Placenta and Ovules of | | | 
“Second” Placenta =H ers ... | 9239+ -0031 | -8868 + -0072 |—-0371 + -0078 | 
| Seeds of “First” Placenta and Seeds of 
“ Second ” Placenta ‘ | -8014+ 0076 [84434 0007 + 0429 + 0123 | 


Aborted Ovules on “First” Placenta and | 
Aborted Ovules on “Second” Placenta | *6793+°0115 | -7070+°0169 + °0277 + 0204 
| Ovules on “ First” Placenta and Seeds Deve- 
| loping on “Second” Placenta... ... | °7356+°0098 | °7191+ 0163 ~ 0165 + 0193 





The equation of the regression straight line for the relationship for ovules is 
y =1:1384 + 9239 a. 
A graphic representation of the slope and the empirical means shows that except 
at the ends, where the observations are few, the observed and the theoretical means 
show a close agreement. We also note :— 
Ovules of first and ovules of second placenta, 
r= ‘9239 +-°0031, 
n= 9229 +0032, 
n—r=—-0010. 
Seeds of first and seeds of second placenta, 
r= ‘8014+ :0076, 
n= ‘8015 + ‘0076, 
n—-r= ‘0001. 


* See R. Pearl, Biometrika, Vol. v. pp. 250, 251. 
Biometrika vii 42 
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The agreement is remarkably close. The condition r>7 for the ovules is due 
to the use of Sheppard’s correction in the calculation of r. Without Sheppard’s 
correction for the standard deviations we find r=*9203, where the theoretical 
condition 7>~r is fulfilled. For the direct correlation for both ovules and seeds 
developing the differences in the constants for the two seasons are about four times 
their probable errors. But considering the possibilities of error in the sampling of 
the habitat in the two years I would not attach much significance to them as 
indicating real biological differences in the two years. The other two correlations 
differ in the two years by an amount of about the magnitude of its probable error. 


The significance of these constants seems to me to be twofold: (a) descriptive, 


(b) as a means of gaining a somewhat deeper insight into the physiology of seed 
development. 


The constant showing the degree of interdependence between the number of 
ovules on the two placentae taken in connection with the means and standard 
deviations, is a terse description of these features in the fruits of a given popula- 
tion. As it stands it is interesting as showing the high degree of perfection in the 
morphogenetic process giving rise to the ovary*. After studying a considerable 
mass of reduced data, some of which I hope to publish before long, it seems to me 
that a comparative study of these interplacental or interlocular correlation coeffi- 
cients presents some problems of considerable interest. 


These organic correlation coefficients seem to me to illustrate very well the way 
in which statistical methods may be applied to certain physiological problems, 


The direct correlation for number of ovules per placenta is higher than that 
for either number of seeds developing or number of abortive ovules. Since the 
ovules are uninfluenced by the accidents of nutrition or fertilization which affect 
the number of seeds or the number of aborted ovules, the correlation between the 
number of ovules on the two placentae is quite independent of that for the other 
fertility characters; but the converse is not at all true. 


We may ask to what extent the correlation between the number of ovules per 
placenta influences the correlation for number of seeds per placenta. The number 
of seeds developing per placenta is closely correlated with the number of ovules, 
hence a correlation between the number of seeds on the first and on the second 
placenta would necessarily be found even though there were no physiological 
relationship whatever between the development of the seed on the two sides of the 
fruit. Our object now is to remove the influence of the correlation for number of 
ovules on first and on second placenta from the correlation for seeds developing. 
If there is a physiological interdependence between the two placentae of the fruit 
depending on some factor other than the number of ovules present, we should 
expect to find a significant correlation between the number of seeds developing 
after the contribution of the ovules has been removed. 

* Since the tables are symmetrical the slope of the regression line is the same as the coefficient of 
correlation. 
422 
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Again we have recourse to the partial correlation coefficient as in the investi- 
gation of the influence of peduncle length upon the relationship between the length 
of the fruit and its fertility characters. Here the more complex formula for four 
variables must be used. 


The partial correlation, r,,,, for constant values of 0, and 0, is given by 
the formula* 


T,.9.= 18,8 (1 = o,0,") — 10,8, 70,8. — 10,8, 0,55 > To,0; (70,8, 048, + 10,8 To.8,) 
ae) =e ee = Se a ee P eae a? 
v1 =e 0,03" it 048," a Tos, + 21 0,0,""0,8,7 a6,) v(1 =~ Vo0g — Vos, = Tos, + 2770.0, 7'0,8,1'0,8,) 








where 0, and 0,, s, and s, are the ovules and seeds on the first aud second placenta. 
But since our tables have been rendered symmetrical 0,=0,, s,=s, and we may 
write 
py atte (Lae) — Breton tre (Pad +P) 
1 = 10? — Nog? — V8 + 2 ow NosYo'g 
where the correlations without dashes are those for the same placenta and those 
with dashes are for the relationships between different placentae. 


Substituting constants for these formulae we find :— 
For 1906, ryy ='590, 
For 1907, ryy ='714. 


Remembering that our gross correlations were 7 =°801 for 1906 and r= ‘844 for 
1907 :* is clear that :— 


(a) A certain amount of the correlation between the number of seeds 


developing is due to the correlation between the number of ovules formed on the 
two placentae. 


(b) The correlation is mainly due, however, to the physiological factors upon 
which the development of ovules into seeds depends. 


Under the term physiological as used here are comprehended, (a) the ecological 
factors which determine whether an ovule shall receive a sperm, and the availability 
of food material and other requisites for growth, (b) the innate vigor or other 
physiological characters of the individual which determine whether a fertilized 
ovul* shall develop into a seed. 


The foregoing analysis shows, I believe, that the main bulk of the correlation 
between the number of seeds on the two placentae is due to a complex of ecological 
and physiological causes, and is not merely a necessary statistical result of the 
correlation for ovules. Some of the biological factors may have an influence on 
both the number of ovules formed and the proportion of these ovules which shall 
develop into seeds. Experimental data in quantity and form for statistical treat- 
ment are desirable. 


* Pearson, K., Phil. Trans. A, Vol. cc. p. 31, 1902. 
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IV. RECAPITULATION. 


The present paper embodies, the results of a quantitative study of the in- 
florescence and fruit of Sanguinaria with special reference to morphological and 
physiological problems. The results presented are chiefly of value as descriptive 
quantitative data. The constants will gain in significance as others from organs of 


distinct morphological type are laid beside them for comparison. 


The following 
points may be noted. 


1. The variation and correlation constants for two collections of Sanguinaria 


made in the same habitat for two successive years show very close agreement for 
some characters and significant divergence for others. The significant differences 
do not necessarily indicate that the populations in the two years are really different 
for the material was not collected in a manner to secure the same proportional 
sampling of subhabitats in the two seasons, and these may differ enough among 
themselves to introduce a slight heterogeneity into the material. These results 
furnish another illustration of the difficulty of establishing “place constants” for 
an organism easily affected by environment. On the whole, the two series may be 
regarded as very satisfactorily similar as material for a morphological and physio- 
logical investigation. 

2. The length of the peduncle and the length of the fruit are to some extent 
interdependent, the magnitude of the relationship being measured by a coefficient 
of correlation of about ‘350. Number of ovules formed, number of seeds developing 
and number of aborted ovules per fruit are also correlated with peduncle length. 
For both ovules and seeds developing the correlation is about 300—-350, the 
intensity of the relationship being slightly but not significantly higher for length 


of fruiting stalk and number of seeds developing than for length of fruiting stalk 
and number of ovules formed. 


Since our correlation coefficient is purely descriptive, we cannot say that the 
length of the peduncle has any influence upon the length of the fruit or upon its 
fertility. All that we can do at present is to measure the degree of interdependence 
of these pairs of characters. Possibly neither has any real influence upon the 
development of the other and their measurable interdependence is due merely to 
the age or physiological vigor of the plants which produced them. Interdependence 
of their magnitude would then be merely the result of their mutual dependence 
upon other factors. Perhaps questions of this sort can be solved only by appropriate 
experiments, but we sorely need quantitative data of the kind presented here for 
numerous species to guide us in the intelligent planning of experiments. 


3. It is not surprising to find that length of fruit and number of ovules formed 
and length of fruit and number of seeds developing are interdependent, but it is 
important to measure the degree of correlation between them in such a way that 
comparison with other forms will be possible. As indicated by our material the 
correlations for length and ovules or seeds developing lie between “500 and -700. 
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The correlation between the length of fruiting stalk and both length of fruit 
and fertility characters of the fruit would necessarily bring about a statistical 
correlation between length of the fruit and its fertility characters, whether there is 
any other biological reason for such a correlation or not. Calculation for the 
relationship between length of fruit and its fertility for constant stem length shows 
that the removal of the influence of this latter character reduces the correlation 
but slightly. 

Clearly, then, the correlation between length and fertility of fruit is due 
mainly to causes other than the length of peduncle. 

In both series the correlation between the length of the fruit and the number 
of seeds developing is slightly but hardly significantly higher than that between 
the length of the fruit and the number of ovules formed. 


4. The correlation between the number of ovules formed and the number of 
seeds developing per placenta or per fruit is high, the constants falling immediately 
above or below 800. This is obviously a necessary condition when the ratio 
seeds/ovules is high, as it is in this case, and the quantitative statement of the 
degree of the relationship is chiefly of interest in comparison with similar constants 
from other material. It tells us nothing concerning the influence of the number of 
ovules formed per fruit upon the probability of an individual ovule developing into 
a seed. In other words, the coefficient of correlation between number of ovules 
formed and number of seeds developing is a measure of the influence of the number 
of ovules upon the absolute fertility as measured by the number of seeds developing, 
but not upon relative fertility. 


A suitable constant for the measurement of this latter relationship indicates 
that in the 1906 series as the number of ovules per fruit increases the capacity of 
the fruit for maturing its ovules into seeds decreases slightly, while in the 1907 
collection there seems to be no significant influence of the number of ovules per 
fruit upon the chances of an ovule developing into a seed. 

It has been suggested to me that the fruits in which the larger number of 
ovules are formed are the more vigorous, and that in consequence they will be able 
to develop a higher per cent. of their ovules into seeds. The discordant results 
secured in this study as well as those announced earlier for Cercis and Robinia 
indicate that extreme caution must be used in asserting that the number of ovules 
formed in a fruit has anything to do with the proportion of them which shall 
develop into seeds. 


5. The correlations between the two placentae are all high, the direct correla- 
tion for the number of aborted ovules being actually the lowest, although the 
cross correlation for ovules of the “first” placenta and seeds developing on the 
“second ” placenta is perhaps not significantly higher. The values for the direct 
correlations for seeds developing are ‘80 and ‘84 and those for ovules ‘89 and ‘92 in 
the two series. These constants show that there is a high degree of bilateral 
symmetry in the fruit of Sanguinaria. 
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Since the number of ovules formed and the number of seeds developing on a 
placenta are correlated, the correlation between the number of ovules per placenta 
must bring about a correlation between the number of seeds developing on the 
two placentae. Evidence is adduced to show that the correlation for seeds is not 
entirely due to the correlation for ovules, but that the larger part of the inter- 


relationship for the seeds developing must be due to ecological or physiological 
factors. 


London, December 7, 1909. 


V. APPENDIX OF CORRELATION TABLES OF DATA, 


The tables for the relationships between the two placentae are not rendered 
symmetrical except for the cross correlation for ovules of “first” placenta and 
seeds of “second” placenta where it is obviously necessary to use each placenta 
once as a “ first” and once as a “second” placenta or publish two tables. 


It is unnecessary to publish tables for the following four relationships :—length 
of peduncle and total ovules, length of peduncle and total seeds, length of fruit 
and total ovules, length of fruit and total seeds. The product moments for these 
can be easily derived from those for ovules and seeds per placenta by taking the 
origin at 0 in calculating the first moments of the arrays of number of ovules or 
number of seeds per placenta. Since both placentae of a fruit were entered under 
the same grade of peduncle length or fruit length the first moment for the array 
gives at once the total number of ovules or seeds formed by that grade of peduncle 
or fruit. This is the same for the number per placenta or the total number per 
fruit. The origin of the other character—peduncle length or fruit length in the 
present case—may then be taken wherever we please, and the product moment 
calculated at once by multiplying up the first moments of the arrays by the 
deviations of the classes of the other character from their origin. The means of 
arrays for tests of linearity of regression may be easily obtained. The standard 
deviations for total ovules and total seeds per fruit are available from other tables. 


The correlation tables for number of ovules failing per placenta and per fruit 
are also omitted, for if their standard deviations be known the more important 
correlations can be derived from the tables for ovules formed and seeds developing. 
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TABLE VIIL., 1907. 


. 
Number of Ovules on First Placenta and Number of Ovules on Second Placenta. 


Ovules on Second Placenta. 
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TABLE IX.,, 1906. 


Morphology of the Fruit of Sanguinaria Canadensis 


Number of Seeds Developing on First Placenta and Number of Seeds Developing on Second Placenta. 


Seeds Developing on Second Placenta. 
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TABLE X., 1907. 


Number of Aborted Ovules on First Placenta and Number of Aborted Ovules 
on Second Placenta. 


Aborted Ovules on Second Placenta. 
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AN OBJECTIVE STUDY OF MATHEMATICAL 
INTELLIGENCE*. 


By WILLIAM BROWN, M.A., Oxford; Lecturer on Psychology, King’s 
College, London. 


I. Introductory. 


OnE of the most pressing needs of the psychology of the present day is a 
theory of reasoning, full, concrete, and capable of practical application. If we turn 
to the chapters on reasoning in any of the standard text-books, we find a vagueness 
and diffidence strongly contrasting with the wealth of detail and confident generali- 
sation prominent in the chapters on lower mental processes+. This state of affairs 
is due in part, doubtless, to the desire not to encroach on the territory of the 
logician, in part to the complexity of the processes concerned,—most of all, 
however, to the insufficiency of the method employed, viz. that of unaided intro- 
spection. That this is so, is clear from the fact that the psychology of thought 
has even a longer history than that of most of the simpler mental processes, and 
yet has made far less progress. 


The unsatisfactory character of the generally-accepted introspective account 
has been recently brought into prominence by considerations, separately conclusive, 
yet quite independent of one another. These originate from the three distinct 
realms of normal psychology, psychopathology, and applied psychology of education, 
respectively. 

(1) In norma) psychology the evidence is derived from certain experiments, 
among which those of Thorndike, Woodworth, Winch, Watt, and Ach are of chief 
importance. They all point to the fact that mental imagery, or, in more general 
terms, sensory content of any kind, is not an indispensable factor in cognitive 
process, and, even when present, is of small efficiency. Thorndiket compared 


* Read before the British Psychological Society, March 13, 1909. Considerable changes have since 
been made in the methods of evaluating the results. 

+ A recent notable exception is J. E. Miller’s The Psychology of Thinking, Macmillan, 1909. 

t¢ E. L. Thorndike: “On the Function of Visual Images,” Journal of Philosophy, Psychology, 
and Scientific Methods, Vol. 1v. No. 12, p. 324, June 6, 1907. 
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good and bad visualisers with one another with respect to memory of facts of visual 
perception, and found no superiority in the case of the former class. One of 
Woodworth’s experiments* was to present to the subject « relation by means of 
two terms, a and b, between which it held, and a third term, c, for which a fourth 
term, #, was to be found bearing the same relation to c that b did toa. The terms 
were cards of different colours and shapes, and also significant words, such as :— 


London : England :: Paris: a, 
Mice : cats :1 WOrms : @. 


The subjects’ introspections showed that, although in difficult cases the 
relation may come to consciousness in the form of a name or some sort of mental 
imagery, in some cases the relation is felt without a name or mental image of 


any kind, and in easy cases the answer comes immediately, with very little 
consciousness at all. 


Wincht+ tested the memory of school-children for drawings under conditions 
in which mental “imaging” immediately after the period of exposure was either 
enjoined or forbidden. No superiority in memorising-power was shown by the 
children under the former of the two conditions. 


Watt+ and Ach§ reached their conclusions in the course of experiments on 
reaction-times. It was found by both observers that the reaction was determined 
in the mind of the subject by the general purpose or known nature of the task set 
by the experimenter, often at a very low degree of consciousness. According to 
Ach, image and stimulus are both secondary to the purpose or task. 


(2) In the realm of psychopathology there is overwhelming evidence that loss 
of mental functioning may take other forms than those of loss of sensation or of 
mental imagery. In particular, the researches of Marie|| and his school{ into the 


problem of aphasia seem to point to a priority of thought over sensory content of 
any kind. 


(3) The practical uselessness for education of the prevalent psychological 
theory of reasoning is becoming more and more apparent in educational circles. 
Outside the walls of the infant school, education deals primarily with the training 
and development of reasoning as such, but this cannot be considered, after the 
manner of the faculty psychologists, as a general capacity functioning independently 
of the material on and through which it works. Each school subject has potenti- 


* R. 8. Woodworth : ‘‘ The Consciousness of Relation,” Essays Philosophical and Psychological in 
Honour of William James, 1908, pp. 489—492. 

+ W. H. Winch: ‘‘The Function of Images,” Journal of Philosophy, Psychology, and Scientific 
Methods, Vol. v. No. 13, pp. 337 ff., June 18, 1908. 

+ H. J. Watt: Laperimentelle Beitriige zu einer Theorie des Denkens, 1905. 

§ N. Ach: Ueber die Willenstitigkeit und das Denken, Gottingen, 1905. 

|| Pierre Marie: Semaine Medicale, May 23, Oct. 17, and Nov. 28, 1906. Pierre Marie: ‘‘ Sur la 
Fonction du Langage,” Revue de Philosophie, 1907. 

“| Francois Montier: L’Aphasie de Broca: Paris, Steinheil, 1908. L. Dugas: “ Une Théorie Nouvelle 
de l’Aphasie,” Journal de Psychologie, Sept. Oct., 1908. 
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ally a psychology as well as a logic; but this potentiality has not as yet been 
actualised by the psychologists, the reason being, as is shown by the cases 
previously mentioned, that introspection alone is insufficient to give us the 
requisite data*. 


Apart from the experiments mentioned earlier in this paper, most of the 
experimental work on the psychology of reasoning has taken the form of a study 
ot “tests” of elementary physical and mental processes accepted problematically as 
symptoms of degrees of intelligence. The history of this movement is already 
recorded with sufficient fulness in Meumann’s Experimentelle Pddagogik, Bd. 1. 
(Leipzig, 1907). Hitherto the results obtained have been either conflicting or 
negative. In particular, the hypothesis, put forward by one investigator, that 
general intelligence and general power of sensory discrimination are identical, has 
not been confirmed by recent, work. 


Mention should, however, be made of the fact that the problem of reasoning 
has been attacked more directly in a recent research by Willis L. Gard, entitled 
“A Preliminary Study of the Psychology of Reasoning,” communicated to the 
American Journal of Psychologyt by Prof. Sanford. The experiments took the 
form of getting the subject to solve simple mathematical puzzles, e.g. to fill up 
gaps in partially worked out division and multiplication sums. It seems somewhat 
doubtful whether puzzles are ever a good test of intelligence, since they really 
involve a good deal of specialisation of interest and ability. 


Il. Historical. 


Turning now to the subject-proper of this paper, viz. the study of mathematical 
intelligence, we find that earlier experimental work is limited to the analysis of 
the mental processes involved in the simplest forms of arithmetical operation,— 
counting, adding, subtracting, ete. 


(a) Counting. As regards counting, experiments have been worked to decide 
the rival claims of the intuitional method (of Pestalozzi) and the counting method 
in the teaching of number. These two methods are based on the spatial and 
temporal groupings of units, respectively, and the experiments therefore fall 
naturally into two classes. The first includes the experiments of Cattell}, Lay§, 
Warren||, Messenger‘i and Helene Nanu**, who all worked with the tachistoscope, 
exposing to the subjects’ view lines or dots, varying in number and arrangement, 
for very short lengths of time. Experiments belonging to the second class are 

* See William Brown ; ‘“‘Educational Psychology in the Secondary School,” Journal of Philosophy, 

Psychology, and Scientific Methods, Vol. vit. No. 1, Jan. 6, 1910. 

+ Vol. xvuit. Oct. 1907. 

t+ Cattell: ‘‘ Ueber die Zeit des Erkennens u.s.w.’’ Philos. Studien, m1. 1886. 

§ Lay: Fiihrer durch den ersten Rechenunterricht, 1 Aufi., 1898. 

|| Warren : Princetown Contributions to Psychology, 1. 1898, Vol. 3. 

‘| Messenger: “ The Perception of Number,” Psychol. Review, Monograph Supplement, v. 1903. 
** Helene A. Nanu: ‘* Zur Psychologie der Zahlauffassung,” 


Wiirzburger Dissertation, 1904. 
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those of Dietze*, who employed a metronome and asked the subject to compare 
series of beats of different lengths with one another (without counting). His 
purpose was to determine how many beats could be correctly estimated without 
counting. The result was found to vary with the rapidity of the beats, the most 
favourable interval between the beats being ‘2 to ‘3 sec. 


(b) Simple Arithmetical Processes. The psychology of addition, subtraction, 
and other simple arithmetical operations has been repeatedly studied by intro- 
spection, either casual or controlled. An example of the former is to be found in 
Binet’s Psychologie des Grands Calculateurs et Joueurs @’ Echecs, and of the latter 
in an article in Vol. xvii. of the Am. J. P. (1906). 


(c) Higher Mathematical Processes. As regards the psychological analysis of 
higher mathematical processes, very little work has thus far been done. In almost 
all cases the data have been school and college marks or class-lists. The method 
employed has been generally that of correlation, but the coefficients of correlation 
have been worked out between mathematical ability as a whole and the equally 
complex mental capacities involved in the other main subjects of the school or 
college curriculum, e.g. Classics, French, English, ete. No instance of analysis of 
the inner relations of mathematical intelligence is known to the writer. 


III. Collection and Analysis of Data. 


Towards the close of the year 1908, the writer examined in mathematics a 
group of 83 boys belonging to the middle forms of the classical side of an 
English public school. The group consisted of five sets or forms, viz. U. V4, 
U. V®, L. VI, U. VI and VII; but all the boys were examined on the same 
three papers—Geometry, Arithmetic, and Algebra—and they had all been working 
along the same lines and in the same environment. The examination papers were 
set by one of the masters, and the writer was expected to return the boys’ answers, 
marked, within the week. Although this arrangement meant that the work had 
to be done at high pressure, it was on the whole a fortunate one so far as the 
research was concerned, since it almost certainly rendered the standard of marking 
much more steady than the latter would have been if the marking had extended 
over a longer time. 





The papers were first marked according to ordinary school standards for the 
sake of the school examination, and then according to a differential system of 


* Dietze : ‘‘Ueber den Bewusstseinsumfang,” Philos, Studien, 1. 1885. 

+ C. E. Browne: ‘‘The Psychology of the Simple Arithmetical Processes,” dm. J. P. Vol. xvi. Jan. 
1906 ; see also Frank D. Mitchell: ‘‘ Mathematical Prodigies,” Am. J. P. Vol. xvu1. Jan. 1907. 

Recently I have myself had excellent opportunities of studying these arithmetical processes intro- 
spectively. The processes seem to go on almost mechanically, but are yet controlled from moment to 
moment by auditory and kinaesthetic imagery. When any distraction occurs, either from without or 
from within, visual imagery is immediately called up, and this is sufficiently stable to allow me to pause 
for amoment and “ collect my thoughts.” The various forms of imagery present seem to act as controls 
of the thought, but otherwise to be quite inactive, the ‘‘driving power” coming from the standing 
purpose to add, subtract, ete. (Cf. supra, p. 353, Ach’s results.) 
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marking based on an analysis of the intellectual processes involved in the answering 
of the questions as far as they could be detected by introspection ; according to 
this system, the maximum mark for any one process in any one answer was 10. 
(For samples of questions, analyses, and actual marks, see Appendices I. and IIT.) 


It was decided that the number of boys examined was not large enough to 
sustain a mathematical superstructure built on so detailed an analysis. The 
analyses were therefore subsumed under nine principal headings, viz., A, B, C, D, E, 
F, G, H, I, distributed as follows :— 


Geometry. 


A. Memory of definitions and general principles (e.g. principle of superposition ) 

in Geometry. 
[1 (iii) + 3 Gii) +5 (1) + 6G) +8 (+9 ()*. 
B. Memory of constructions in Geometry. 
[2 (i) + 3 (11) + 4 (1) + 5 (iii) + 9 (ii)]. 
C. Memory of preceding propositions, and power of applying them. Geometry. 
[1 (ii) + 2 (iii) + 3 (i) + 4 (iii + _v) + 5 (i + iv + vi) + 6 (ii) + 7 + iv) + 8 (ii + i)). 

D. Recognition of necessity of generality in proof, and power of recognizing 

general relations in a particular case. Geometry. 


[1 (itiv) +3 (iv) +4 (iv + vi) +5 (v) + 6 (iii) + 8 (iv)]. 


Arithmetic. 
E. Accuracy in Arithmetic. 


[1 (ii) + 2 (iil) + 3 (iv) + 4 (iii) + 5 (iv) + 6 (ii) + 7 (ii) + 8 (iil) + 9 (ii) + 10 (iv) 
+11] (ii) + 12 (aii) + 13 (ii)]. 

F. General memory of rules and power of applying them. Arithmetic. 
(1 (i) + 2 (i+ ii) + 3 (ii) + 4(1 + 1) + 6 (1) + 8 (0 + i) +:10 (1 + i) +_ 11 (i) +: 12 + ii) 


+ 13 (i)]. 
G. Power of doing sums in percentage and proportion. Arithmetic. 


[7 (i) +9 (i) +12(i +i1)}. 
Algebra. 
H. Accuracy in Algebra. 
[1 (iit) + 2 (iii) + 3 (ai) + 4 (iv) + 5 (ii) + 6 (iii) + 7 (iv) + 8 (iii) + 9 (ii) + 10 (iii). 
I. General memory of rules and power of applying them. Algebra. 


[1 G+ ii) + 2 (ii) + 3.(i) + 4 (ii + iii) + 5(i) + 6 ii) + 7 + ii + iii) + 8D). 


* See Appendix I for meaning of these numbers. 
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These marks, together with the ages of the boys and their total marks for 
Geometry, Arithmetic, and Algebra, gave thirteen series of measurements and also 
thirteen corresponding series of “ranks” or orders. Correlation coefficients were 
now evaluated, using both actual measurements and also ranks. In the former 
case, Pearson’s full method was used, the measurements were grouped, correlation 


tables were drawn up, and the product-moment formula, rz, = S(zy) was 


NOx y” 
applied*. In the case of ranks, two methods were used, (1) the product-moment 
, 2 
formula, which in the case of ranks reduces to 1 — Man where d= difference 


of corresponding ranks in the two series, was applied. This gave what 
Pearson+ calls p, or “rank-correlation,” and the result was then converted into 


“true variate correlation” by the formula r= 2 sin (Fe); (2) the formula 

6S (9) Lez : * 
R=1 ae ay where gy=gain in rank of second series on first{, was applied, 
and again the true variate correlation was obtained by Pearson’s formula 


r= 2 cos ¢ (1 — R)—1§ 


The above-mentioned two formulae giving r in terms of p and R respectively 
were deduced by Pearson on the assumption of “normal” or Gaussian distribution. 
If the correction is not made, in other words if ranks are themselves taken as 
measurements, the distribution obtained is a rectangle,—not obviously a reasonable 
assumption. 

The coefficients (crude values) obtained in these three ways are given in the 
following table :— 














=" 9) (marke) | r=2sin (j ?) [ranks] | r=2cos ~ (1— R)-1[ranks] 
| NG | 6 { 3 
as Fal eles | 
ee | } ie 
| Arith Ale .. | 9 -794-03— 65 | 65 | 
Y “a ° “Re | “AO 
| Geom. Alg. aod 66 + °04 | 63 59 
| Geom. Arith. ... | 58 + 05 61 “57 
cD oi) 81+ °02 | 77 | 74 
EH 69 + 04 ‘5D | 57 
CG ‘b9+ 05 ‘BG 53 
FI 49 + 06 49 | ‘51 
GI | 49 + 06 54 ‘BS 
DG | “44+ -06 43 “46 
| BG 26+ 07 24 "24 
| 
| 








* For a specimen correlation table, see Appendix II. 

+ Karl Pearson: ‘On Further Methods of Determining Correlation,” Drapers’ Company Research 
Memoirs, Biometric Series, tv., London, Dulau & Co., 1907, pp. 11 and 18. 

t C. Spearman: ‘‘Foot-rule’ for measuring Correlation,” Brit. Journ. of Psychology, Vol. 1. 
Part I, July, 1906, pp. 100—104. 

§ Karl Pearson: Op. cit. p. 17. Spearman suggested in his ‘‘Foot-rule” article the formula 


r=sin G R) , Which he attempts to justify on merely empirical grounds. A full criticism of his work 


on “ranks” will be found in Pearson’s article above-mentioned. 
Biometrika vir 46 
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The discrepancies between mark-values and grade- (or adjusted rank) values 
are due to deviations from normal distribution*. This is empirically obvious 
from the fact that the discrepancies are larger the larger the deviations from 
normal distribution. This is also the case with the following additional coefficients, 
calculated by only two of the methods :— 





| 











| 
S (xy) cg ' 
r= saa [marks] r=2 cos 3 (1- R) -1 [ranks] | 
| 
oc eee |. | 
Arith. G | 78+ °03 ‘75 | 
Alg. Gt 71+-04 | 57 
Geom. G “67+ °04 “60 
AC ‘78+ 03 81 
AD | 65+-05 | 60 | 
FG | 64405 63 
BC 60 + 05 | ‘68 | 
Cl 55 + 06 ‘57 
CF | ‘51 + 06 50 
AB 49 + ‘06 “65 
BD ‘37 +°07 | “29 
| | 





Thirty-three other coefficients were worked out by the formula 


r=2cos 5 (1—R)-1 


only. They are as follows :— 


Geom. A °81 HI "92 AF “46 
~ & 6 EF ‘82 | DE 46 
ot AO Se mm 46 | se a 
2 D e | AH ‘61 DI 43 

Arith C 84 | OH 87 | AG «4 
oy Dos | EI 57 | DH 40 
. se FH ‘55 BF 28 
~ - GH 53 BH 26 

Alg. C ‘59 DF 49 BE 23 
—— AI “47 BI 22 
ae eo CE 47 
ee oe ae 


All the above coefficients are “crude,” i.e. uncorrected for the effect of extraneous 
conditions. In the present case the disturbing conditions are principally difference 
of age and difference of class or form. To correct for the first, coefficients were 
evaluated between age and each of the other twelve series. For ranks 


| r= 2 cos F(R) - 1 


* Considering the smallness of the sample, and therefore the largeness of the random deviations from 
possible normality, the two formulae lead to results in strikingly close accordance and suggest normal 
distribution as being the rule for the psychical characters observed. 

+ The exceptionally large discrepancy here corresponds with the fact that the characters compared 
: Alg. 26, 16, 19, 9°5, 6-5, 
Unit of grouping, 7 marks. 


have the two most skew distributions of the series, their distributions being 
4,1, 1 (a J-curve). G 30, 26, 12, 4, 2, 9 (a U-curve). 
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the values were as follows :— 
Correlations with Age (ranks). 








Geom. 0°19* , | B O11 F 002 
Alg. 019% C 0-06 G -0-12 
Arith. 0°04 D 005 H 0-18* 
A 0°21* E —0°02 I 0°19* 
P.E. for zero correlation = 7738 = 0°09. 
V83 


The only values greater than twice the probable error, and therefore possibly 
appreciable, are those marked with a cross. The corresponding values, using marks, 
are :— 

Geom. 0°06, Alg. 023%, A 0:09, H 019%, I 0:07, 
P.E. for zero correlation = —— = 0075. 


Dery 





In the second case, therefore, only Alg.and H are affected appreciably. Owing 
to the discrepancies between marks and ranks, only the coefficients obtained from 
marks were corrected. The formula used was, of course, the partial correlation 
coefficient 7,, for constant age*, viz. 

Tyo — 1131 98 
V(1—7°)(1— 122) 
Thus, for Geom. Alg., putting Geom. 1, Alg. 2, Age 3, 
__ fates ee Cw 
V(1l=n3)(L—re?) V(1—0°)(1— 23) 973 

The second main disturbing factor was the arrangement of the boys in sets 
or forms. The boys of the higher sets might have been expected to do better in 
every part of the work than those of the lower sets, because they had covered more 
of the syllabus than the others. Instead, however, of calculating correlation 
coefficients between order of form and each of the series of measurements, the 
“correlation ratio,” », was evaluated+. In carrying out the correction, 9 is 
substituted instead of r in the partial correlation coefficient. 





T= 


T= 


The values of » obtained were the following :— 


Correlation Ratios, “ Form” and Character. 
| 


Geom. ‘71 


D ‘od 

Arith. °59 | E °-73 
Alg. "87 F ‘9 
A +59 G 65 

B 28 SO 

C ‘66 I-80 


Age ‘41 

The fact that for age is ‘41 while the correlation between age and total marks 
is nil seems to show that the effect of difference of form in the other series is due 

* See, e.g., G. Udny Yule: ‘‘On the Theory of Correlation,” Journal of the Royal Statistical 
Society, Vol. ux. Part rv. Dec. 1897, p. 833. 

+ See Karl Pearson: Biometric Series 1, Drapers’ Company Research Memoirs, pp. 9 ff. For an 
example of the method, see Appendix Il. 

46—2 
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mainly to the difference in the extent of syllabus covered in the different forms. 
This, at any rate, seems to be the most satisfactory solution of a difficulty which 
the writer himself would be the first to admit. 


The finally corrected values of the correlations are :— 


Alg. Arith. 76403 | BD 27+°07 +| ‘$FG “41+ 06 

Geom. Arith. ‘28+°07 | CD 69+°04 | BG *11+°08 

Geom. Alg. ‘18408 | Arith,G 65404 | DG 11408 | 
AB ‘42+-06 | Alg. G ‘45+°06 | CF 20+07 | 
AC 64+ 05 Geom. G 39+°07 CI 05+°08 | 
AD *31+°07 CG 28+ °07 EH *33+°07 | 
BC 57 +05 IG 00+°08 | FI 04+°08 | 


Several interesting results can be obtained from these values by applying the 
theory of multiple correlation. Thus, limiting our attention for the moment to 
Geometry, Arithmetic, and Algebra, it is possible for us to deduce the degree 
of correlation between any two of them on the assumption that the third ability is 
constant throughout the series. It is given by the formula 


Vio — Ti31 23 





= -— <== @ Ss ab >, Pp. 359. 
Ja=r 0-12) [See above, p. 359.] 


The partial correlation coefficients in the present case are :— 
Geom. Alg. —°05 +°08, Geom. Arith. ‘23+°07, Alg. Arith. ‘75 + ‘03. 
We may perhaps infer from these values that geometrical ability is only related 
to algebraical ability through the mediation of arithmetical ability*. 


Tis 


Taking now four variates, e.g. A, B, C, and D, it is, in a similar way, possible 
to deduce the degree of correlation between any two, say C and D, on the 
assumption that the other two abilities are constant throughout the series. 
Putting the numbers 1, 2, 3, 4 in the place of A, B, C, D, the partial correlation 
coefficient 
7,= — Ys (1 — M2") +x (TyT2— Ta) + Vm (Tat ie — Te 

Vv1—- Ne — Ng — To? 
= 93. 

There is thus a very close relation indeed between memory of propositions in 
geometry, and the power of recognising general relations in a particular case 
in geometry. 








+ 2p Tou Vv1- My — Tis" — Veg" + WWW sl ve 


Obviously, the method could be extended indefinitely. 

A glance at the table of corrected coefficients suggests the following additional 
results :— 

(1) the ability to do percentage and proportion sums in arithmetic is more 
closely related to essential geometrical ability than to essential algebraical ability ; 


* This result harmonises with the view that mathematical reasoners fall into two types, the so-called 
“geometrical” (or “intuitional”) and the ‘‘analytical” (or “‘logical”) types. See H. Poincaré: La 
Valeur de la Science, pp. 11—15, 

+ See Karl Pearson: “ Selection, etc.” Phil, Trans. Vol. 200 a, p. 31. 
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(2) the close relation between arithmetical ability and algebraical ability 
is due mainly to habits of accuracy common to both; 


(3) memory of preceding propositions is the ability most closely related to 
the other forms of geometrical ability in school-boys,—is, in fact, the central ability 
in school geometry. 


It would be interesting to test the results for the existence of a general 
geometrical ability or a general mathematical ability, but unfortunately the only 
formula hitherto suggested for such a purpose, viz. that given by Dr Spearman*, 


is open to very serious criticism+. The writer hopes to return to the problem in 
a future paper. 


* Krueger and Spearman : Zeitschrift filr Psychologie, Bd. 44, 1906, p. 84. 

+ See Clark Wissler: ‘‘The Spearman Correlation Formula,” Science, N. 8. Vol. xxm. No. 558, 
Sept. 8, 1905, pp. 309—311. 

William Brown : ‘Some Experimental Results in Correlation,’’ Comptes Rendus du V1™ Congrés 
International de Psychologie, Genéve, Aug. 1909. In this pamphlet mention was made of the assumptions 
upon which the applicability of Spearman’s formula was based, and an alternative proof of the formula 
based on those assumptions, contributed by Mr G. Udny Yule, was brought forward. 

The formula may be written 


2 — 
Tt. Tits 
where X and Y represent the unknown true values to be correlated, and X,, X2; Yi, Y2 are two pairs of 
observed values, vitiated more or less by errors of observation. 
Let 


typ a MEME MN GMs, 


%=£+5, yi=yta, 
%=L+52, y=yter, 
where x, y, 5, € represent deviations from means. 

The assumption involved in the formula is that the errors of measurement 61, 52, €1, €2 are un- 
correlated with each other or with zx or y. 

I suggested that the justifiability of this assumption might be tested by correlating X, ~ Xz and 
Y, ~ Y,, X; ~X2 and X,, Y; ~ Y2 and Y, which should all give a zero value for the coefficient. In this 
statement the sign of subtraction (—) should have been used instead of the difference-sign (~), and only 
the first of the three coefficients, viz. that between X,—X»2 and Y,-— Ye, should be expected to give 
a zero value. 


Thus, S {(x1— 22) (yi - Yo) 


ty _~, 2S = 
PLP VS (e—22)?S (yi — yo)? 
_ S {(5; — 52) (e1 - €2)} 
~ v/S (8; - 83)? S (€1 - 2)? 
=0, since the numerator vanishes. 

Applying this test to some of the material on which I based my Geneva pamphlet, I found :-— 

1. In the case of accuracy in bisecting and trisecting lines, where the subjects of the experiment 
were 43 adults (Group A in the paper) 

 p, — B,= 930+ 0°09 ; 
T- T, 

2. In the case of speed (S) and accuracy (A) in the addition of series of 10 single digits, where the 
subjects were 38 elementary school children, girls between the ages of 11 and 12 (Group C) 

1g, — S,, Ar— Ay = 0°35 +009. 

Thus, in each case a correlation greater than three times its probable error was found, showing that 
errors were almost certainly correlated among themselves and hence that Spearman’s formula could not 
be applied. 

In cases, however, where the correlation does work out to zero we cannot infer that errors are 
not correlated, but only that 

S (Bye1) + S (Sy€2) = S (Sre2) + S (5ge1). 

The formula may still be quite inapplicable. 











362 An Objective Study of Mathematical Intelligence 


The nature of the correlation tables, and the form of the regression curves to 
which they gave rise, seemed to indicate that the number of boys examined (83) 
was sufficiently large to give fairly reliable results. On the other hand, the 
numbers furnished by separate school forms of average size (say 20) are certainly 
too small. Thus, the correlation coefficient between mathematics and classics was 
calculated for the Mathematical Eighth Form of the same school for Christmas, 
1907, and again for Christmas, 1908. The two values were 0°20 and 0°52 
respectively, showing a discrepancy of 160°/, of the smaller value! Similarly, 
the mathematics-classics coefficient for the VIIth, U. VIth, and L. VIth forms of 
the mathematical division investigated in this paper worked out as 0°23, 0°76, and 
— 9°25 respectively. This would seem to indicate that the results of Correlational 
Psychology hitherto obtained, where small numbers of cases have almost invariably 
been used, are utterly unreliable. In such cases the ordinary formulae for 
probable errors (the proofs of which rest on the assumption of a large number 
of observations) do not apply, and consequently no clear appreciation of the 
significance of the results is possible. 


APPENDIX IL. 


A. Sample of questions in Geometry. 


1. (a) If two triangles have two angles and a side of one equal respectively 
to two angles and a side of the other, they will be equal in all respects. 


(6) ABCD is a square, perpendiculars BL and DN are drawn to any straight 
line through A. Prove that AL = DN. 


4. (a) The straight line from the centre of a circle, perpendicular to a chord, 
bisects the chord. 


(b) OP is the radius of a circle. On OP as diameter another circle is 
described. Prove that all chords of the larger circle drawn through P 
are bisected by the smaller circle. 
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B. Sample of analyses of above two Geometry questions. 
1. (a) (i) Recognition of necessity of generality in the proof. 


(ii) Memory of general truth that three angles of a triangle equal 
two right-angles, and application to this case. 


(iii) Principle of “ proof by superposition”: memory, sufficient under- 
standing, and application of it to case under consideration. 


(b) (iv) Power of recognising general relations in a particular case. 
(v) Readiness to note and make use of the more special relations 
obtaining in the particular case. 


4. (a) (i) Rational or merely visual memory of construction. 


(ii) Connecting equality of lines with equality of triangles, of which 
the lines are sides. 


(iii) Memory of result of a particular theorem (on right-angled triangles). 


(b) (iv) Power of keeping essential and general relation of above theorem in 
mind, and of looking at the present case through the medium of it. 


(v) Memory of theorem on angle in a semi-circle. 
(vi) Power of seeing that the relation is a general one (i.e. power of 


noting from just what facts or relations the conclusion follows, and of 
realising that the sequence is not influenced by other facts). 
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APPENDIX IL. 


Evaluation of r 
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Correlation Table. 























Geom.” 
Arith. 
Geometry. 
ele i|lgrkes 3 > | »% a | 
™~ oa) ; S ~~ >) S RX a) 
} ~ ~ ~ 
Te iS | | | | ] | Totals 
| it ee oe oe ee oe 
a ae ee 5 ~ 
a} Owe = 18 8 a ae eee Re fae ae 4 
5 i iG | 76 1 BS 2 Stee — | 1 Aa ay) aa 15°5 
S| 30— 45] 1 T ee be oee: |ode 1 | — _ — | 235 | 
=| -—6o} —|2 |7 |5 | 75 |] —] 1 = | - | 29-5 | 
| 60— 7a 4 2 es i = 2 So — | 1 55 | 
a 75— 905 — Hee Sal bead 6s 2 — | l= 4°5 
i cn dw Cree he 15 | 1 : ee 55 
105—120 | — Se pe = pie di a3 | a 
| 120-135) — | — | — | — }—- | 1 | }—-|— Fi | 
| | | ] 
Totals 2°5 | 175 | a1 | 195] 105] 7 7 | 2 1s 83 
B. Evaluation of » for Arithmetic and Order of Form. 
Form (). 
U.vB | U.VA | LVI | U.VI VII Totals 
} 
| ae Ss Sa i Viet a 
é— 35 2 > =e = = 4 
15— 30 8 2 3°5 2 — 15°5 
I I I I 
30— 45 5 7 4°5 7 = 23°5 —Arbitrary mean 
° 6 oS e from which true 
means and stan- 
45— 60 3 4 5 4 6°5 22°5 dard deviations 
rN 1 I : : e are calculated. 
=> 
© 60— 75 a 2 l 1 1°5 55 
‘S 2 2 2 2 
2 
a) 75— 90 1 05 3 AD 
— 
“= 3 3 
< J 
90—105 ~ = — 15 4 55 
4 4 
105—120 - 1 —— 1 
5 
120 —135 1 1 
6 | 
nm 7 / 7 
Totals (x,) 18 15 | 17 18 15 83 (1) 
= 
Means : y Z = y= ‘68 
= . “$ 2°36 4 $e 
(yz) 0 40 15 1°36 30 o,2= 2°65 
Pe oe i 
= ye!’ = 3577, 
7 N oy” 2 
*e n='59. 


Biometrika vu 





























































































































rE | 98 | 6& | TE] for | OT | Fee | 9¢ | Fee | 09 | ae ae —— yim SR tl ee APRS ee | 
go LEY Re | 96] Lo | 82 | oc] cs | st az |o | tc | o¢ | 468 | 22 | oc} ec lsr] os| oe | ae] SS | 2%] Ort | ts forst) ern 
o¢| trl oe | deh ez io Fol ccd top| Sod So Ot] of | se] eri ce] ter | ee] tec] a | oo | be [ane | fe [toc] oo |g (Ror) SSS] % 
= sabi vi r9 1 6c1 ze} oel ze | sel zel tel 22! te I ¥eo! ge 19 | os | & | orzt! § x b 
8st | 6¢ » 42 | se] for | or Jf 6F | 89 | for | 99 FJ Ge | oc J ¥1z | o9 J Le | of | 92 | of | _8z | 6] os | To p fot | oe ez | sot | 2% | 9-9T Be 2) | 
| 96] cb | fez | TF |] for | or | 61 | 42] fet | szf 6¢ | orf TZ / 0c] f29 | oc | zt | of | Ftz | er] et) 22 | fee} ge | ge] eet | 2g | O1-FT| (Na ‘dT 
| Of | GZ GF | 66 ¥9¢ g 99 | 8S | Fo | 8F 6¢ | OT rE | OS LE | 0€ 9% | O€ | THF | C% | 169 Sz 0€ c¢ T6F Sol TOT | €-LT ‘W'OC'L | OF 
| $e¢ | st | 61 | FF | for | ot | fee | G2 | F2t | ch] Fe oT] FL) 2t4 4e| og] 92 OF | f1F | Sa] OT) 89 ct | PF ce | set | 6 | FLT ‘Nd ‘a 
% - | - P 
62 | ce | ce | ce] 32 | 0 | fe | 69 | f29 | ee | 8% og | st | oF] 28 | of | fet | ee | 6e | 6 19 | 1¢ | fer | ¥F T¢ | FOr | 9T | 9-91 1’) 
1% | ce} for | og | 2¢/ 6 | f4e | 89] foe | so] ot o | fz¢ | og] 9 | oF] 2h | OZ] Fee | ce] fre | sr | fee | ge OF | SIT | _€& | IT-LT I'M 'S | 
| ¥6¢ | st | fon | OT] 22 | 0 | fe | oo] #2) te] 6¢ or] sr | oF] 2e| og] 24h | OZ] feo | IT] F9/| OF | feo | se | fro | 92 | fI¢ | @St| “A 'H'S ‘A 
| eb] te | ¥e¢ | 12] 62) % | 9 | oh] oz | ee] fer oF | f2e | o¢ | f29 | oc] 2h Of] fs¢ | et] es) IT 1e | 89 €9 | LL 6 | 0-91 ‘Od 'f| 98 
S 69 | or | 1¢ | 32 | for | or | fee | 69 | FoF | Fo] cL oO | fs2 | or] F249 | Of] 22 9 fse | 2 | ft¢ | sé | 08] 9T | 2] T9 LL | 0-FT | ‘a‘d| TAT 
| ‘ony Were. L OD Pe NS ee ewe OSS alone Te 
TE sae ae, ee ee Pee . | . 
-> | 1F/ ze] re | ee] for | or] t9| FF] ee | oF] 6e | oz | fz¢ | o¢ | f29 | oc] 08 | t | ft | es] foe | pe | se | se | ee | 06 | fez | &9T) “MPM! 
~> | | 
= | 6t/ se} 92] OF] 61 | 02 | ftF | G9 | F6t | ee fF | or | 92 | c¢ ] 24e | 0¢] 8 | OFF f9z | OF] 22! 8g 8I | 99 1 | #90 | GL] TFT "MS 'N 
L | | 
2 Lt| 09} 9 | 09] for | or] 11 | 88] fe | sot] 82 | og] for} oz] 9 | OFF 8 | OF] ¢ - ¢ 6 -€% | 09 § | 1% | 93| &9T] ‘Md ‘DO 
~ #8 | 9¢ | ¥er | 83] 3210 | foL| Se] 2h) ee] ge 0 TL | 03 | 24 | of | $69 | OT] foe | 9¢ | Be] ee | fs2| LT | Le) 98 06 | *-9T; ‘L‘V‘H| 08 
ae fet | on | og | 98] 32] 0 | foe | OL} 6e | 4¢] 6¢ | or] Fe] 0g] 4e | O€ | 2h | OZ] 02 | ce] eo] Lo | f9F| OF | foe | HT | 29 | 8-FT s #| 
S | % | we] ee | $8 f¢ | OF] T | Set] T | sti fot | oF | fer | co] Le ) oe] 8 | or], fia | ery t | sory 2 | 88 | & | #9 | foe | 8ST ‘SNH | 
‘> ld nd | 
3 | 9¢ | oc} ¢2}8 | 61 | 02 | fe | FLY 22 | OL} 69 | or] 8] 2 18 | OT] 28/0 | f9| 11] 8h] OF | f8L | LT 149 | 89 | fst | $-ST ‘SW 'O | 
= Le | ¢¢ |] ge | oc | 22 | ct | fF | 99] 82 | t9] fe | o¢ | For | oz} Le | of | 2h | Of | fer | ta} fre | st | ft] 32 | se} it | fzt | ¢-9T "SL °f | 
S | % | se] | 29 | for | or | fre | 89 | foc | co | F1c | ce | zt | coy ze] os] st) te] 9 | ez fcr) SF ez | 09 | f4T | 9LT J ee] 6-ST ‘L‘O'f | $8 
Ss 82 | eF | sz | 48 | for | OT] foe | OL | eh | co] 6¢| oc] sa} Fe} 2e | OF | 92 | 0€ | Foe | 9¢ | 68 | OF €z | 09 | foe | arr | fet | I1-9T NO ‘OD 
S 91 | 99 | fet | co} St | 4a] feo | FLY Of | og | FIT | ae] 2 | os] 09/ oa] 8 | OF] fee | ce] 49] 22 | F | 96 | F2| SST | SL] FT ‘I 'a ‘a 
f1z | 9¢ | 12 | eh] 32] 0 | f99 | OF} 92)| cel sr | zt] itr | tH] F249 | oc] 9¢ | 90] fet | co | f69 | ce | sh} 6e | se) LIT | fet | TT-9T ‘H ‘VW 
Se §9 | pe] 6t| FFE 22/0 | 68] 99] Of | o¢ | 68] oc] Te | ce] £249 | of] $9 | cL] ot | cof fst | TH | fee] ge re | 98T | ee | 6-ST “I'M 
S | tee} op} ot | og} f6 | ce] F | OI] 2 | 6st ts | sey 2 | os] for | se] zt) ce] 21/6 sr} TL | 9 | 68 | @t| 60G | f4T | ¢-9T ‘ALO | 08 
> | 8 | es | for | oF | for | or] 9 | eh | f09 | re] ot | ce | fet | cof 9 | OF] fre | 8] ot | te] fre | ce | oa] 6¢ | Fee | ott | foe} 8-1 ‘a ‘Vv ‘H | 
= @ |s6] t | 62] IL | og] ¢ | ZI & | cot] 6e | og | FIZ] 09] 29 | ce] ice | 62] T | 66] @ ZIT] Of} ge @ | 993 | LL] O-FT ‘a ‘a0 
= | ar | ahi fe | so] te} 2] 2 | te 9 | e6 | fz | oF] Fe} oc] ze | oe] 92 | oF | fr | ec] 8 06 | 248] 9¢ €1 | 66T | OL | F-FT ‘v‘a‘o 
| $12 | 9¢ | 6t | FF | FOF | oT | f09 | 8h | Fco | ee] G2} 0 | GF] ce] Le) og] LF | 02 | f9z | oF] fro | ge | is9] 22 ze | sor | fet | ¢-ST ‘V'W'S| ‘IA’ 
Sy ae aa) 7 ee OS See eee ey eee se SS San eae | wa eee Sere Satie 2 . 
= Re ae ie Pg Rn ot a 
3 | 6 | LL} 6 | $8] 9% | 9T] 92 | TL] #6) 49] 82 | o¢ | FE |] og} 48] OF | 92 | OFF eT | 9¢] GE] OF et | tL | f4t | get | fF | OT-LT ‘L‘O°V| $1 
° | fez | oF | fze | ce | fea | ct | fr | oo | fet | 227 8c | og] fte | oo} Le | og] Le Oc] Fe] IF] Of) 6 | Fe | Fo | 96) FET | ge | ITF ‘Sf H 
= | tT | oom @ | FL, #8 | orf ¢ | STI] 8 | sey F | sete je Le | oe] FI 6¢q @ | c6] TI b g TIT} T | #86 | 06] ¥-9T oF) | 
st | 69] ir} eo] fe | on} 2 | 90 6 | 98] oF | ST] OT} eo] 9 | OF] 8 | OFF Gt] Le] 2 1 | 61] #9 | fot | ata | fee | -CT | UUM ‘a 
Ss 92 | ch |] foe | ce | 2 | ze] foc | SL] of] ss] oc | ze} ¢ | 9s 9 | oF] fot se] et | zz] or] os | fot} 08 | +1 j| Let | foe | 8-7 ‘d‘H ‘u | 
=| ot} ext fz | 9¢ 4 ¥6 | ce] 9 | sot] fe | tor] 8c | og | et | 69 | far | 6e | fet | se] 8 | 99] F | 96 | for] O8 | ¥9 | ate | OF | FST ‘W'S'S OF 
TI | e2 | for | oF | fe | oF | fos | 92 | f2t | e2 | 8e | og | For | oz] so | sa] 9c) 0g] ¢ | 6 | Ls | 6 18 | #9 | cto | & | 9-8T ‘I'D 'V 
| 
Lt | 12 | ez | cb] 6T | of | 8 | 86 | for | o¢ | fet | oF | Fz | o9 | Le | og | 92 | of | Fe] IF] st] 09 92 | 89 ez | 6ST | SL] & FT ‘H ‘d‘H 
6¢ | ee | for | of | fee | ¢ | fot | OS foe | se] ¢ | 09} 2 | os] 9 | OF] 92 | 08 | Fes | ce] Foz] eg G e6 | ST | SLT | 62 | 0-91 ‘Ad “1 
© 02 | 4¢] ee | Se] _6t | oc} et | 48] fea | cof gz og | #12] 097 9 | oF] LF | 0c] st | st] 62] ae 4t| 89 | fet | sot | for | ¢&21) fa ‘Vv H 
= 92 | ch | fee | 1H] HL | 2g] 6 | 16] ¢ | tO Tt | GLY & | oct] 2e | oe] T | og 62] ze] ¢ *6 z cet] ¢ | 99% | f9¢ | 8-ST ‘a°3x'0/|9 
€ | ss} rt] eo] st | %] 1 | #84 fee | 09] «@ | sz 1 | ze 9 | oF] & | SF] OT] 29] of] 69 T ser} ¢ | e¢e | feo | 6-51 ‘OMAN 
> | 48} _¢ | OL} fet | c2 | foe | OL et | os | ft | cee F | ce] 6¢ | 24a] 8 | OF] ot] eo] st] so | 8 | gs | fot | ete | ge | 11-41 ‘O° “A 
¢ | os] fe | co} fe | oF] 11 | 8s] fet | 22] co | 6 | te | HH] Le | oe] 09/ st] F | 62] 9 | 86 | fee | SF | 6 | Os | fr | o9T| “a'r N'V 
fet | ob | 11 | eo] 2 | Lt] fs¢ | 19 | fer | 2] og | ct ith | HF] ze | og] 8 | OF] FG | IF] se! Gh | Of | ge | fea | cet | fee) ¢-9T) ‘V'V'H| 7 TA) 
| | | 
ee oe Say SS aaa wes a ae ar 
H Ji2y) 9 eu a amu a peer ad peru) O Pea, a Peed Vv piceg ‘Srypeey| “QTV wits aie. yuRy mer yuey| ey | owen WLIO 


























“SUN pup sysop “TIT XIGNUddV 








‘O ‘I xtpueddy sag = "1/8 48 syavur Aue urezqo 0} poTIey 10 ‘OT *ZIA ‘YIBUL TAINUITxBUT oy} pauTed rayy10 Loy} sesvo SOUL UT 4vq} TONS 10M 
poyreur satyroedvo-j18d oy} Arjoutoex) UI 4B} OR OY} OJ ONP SI SINT, ‘q pus ‘o‘g ‘¥ sUUINTOO Ur ‘su94 07 A[TeIOedsa ‘saIndy UaAe 10; SYAVUT JO sBIq yvoI3 A[ZUTUIE0S OY} OJON , 
*(Baqo3ty) uieq} Burdyddv jo somod puv sorna jo Atowew [ereueL) “Tt ‘wiqesty ur ovanooy "FL *(‘qy11y) UoMs0doad pus o8vqueor0d 
UI suIns Sulop jo aMOg “4 *(‘yyIzy) Urey} Surd{dde jo reMod puv saya Jo A1oww [eIeUeH *T ‘orjommyyiay ur Aowmmooy “oT *(‘u10a4)) osvo IB[NOTJAVd B UT SMOT}BTA.L 
[et0ue3 ZulstaZover Jo 1oMod puv ‘yoord ut Ay1[/e19U0F Jo AyIssooou Jo UONIUZODNxT “CT *(‘u10ex)) way} SurA{ddv jo zomod pus suoitsodoid Zurpeoerd yo Aromepy “C—O *(*uI008} 
suoyonrsu00 jo Atom,“ *£you0ex) ut (uortsodsedns jo ofdroutad *3'9) satdtoutzd yeroue3 puv suoiytugep jo A1oweu oj syrvpy *y == ‘shuspvay-saj3a] fo uonvunjda gy 


es | 



































































































































4 " 
¢9 | 9T | 19 | St | for | OT} 9F | O9fF 82) FO] GL | o | fez | or] 1s | or] fez | 8 | isc] er] foc es | fon) st | se] te os |Inet| ‘aH ‘d| g9 
1g | #2] 99 | ST | for | OT] _6F | 8¢ | 82 | F9] 8c | 08 | fF | FF] Le | OF | #69 | OT] 9¢ | oT] 8F| OF | FE] Fo | FF! OTT | 28| 6BET|"M’H ‘A ‘A 
ho 18|0 | f0L| OL} G2 | 0 | f0c | 92 | fue | 9%] oL|0 | fs | O | 18/ OT] 0 | f49| 2 | fer, TH | e8)| 8 FL | 9¢ L | bLT| ‘M'O°H 
(Je) | | | | ss 
of foo | st] fez | 6 | for | or] for | se] 99 | Ge] cz) o | fo9| ea] ze | of] 09 st] feo} 2 | 2 Of | for 9% | for) eo | 29| 941) “MH 'V| 08 
18/0 | f18/0 | 22/0 | 92| 82] 08 | He] Fe 92] f99 | Ga} sz) st] f69 or] f1g/o0 | 18) et | 29 ze | fos | oF G¢ | IL-FT ‘NV 
fee | 16 | f49 | Ir | for | OL] GF | se] as | OL] Ge | oc] fle | og] cL GT] Fes 8 | zi ¢ | | Fo | Fee ee | e¢| co | 49) 991) “Ia 'D'a =a 
18|0 | fts}/o | | o | fs) 8 | e8| or] oe | 0 | te | of] 2e oe] f69 or] fis}o | #8) 9 | ies) % | eB] O8 sg) TST ‘I'd ‘Vv 14 
| 
| ag | | a Aa 
fon |b | 62) ¢ | for | Or] 18 | ST] 82 | Le] SL} 0 | f4¢| og | Le | o¢ | fe9 | er] 22) ¢ | 2 | 02 | feo 9 | Foz) e¢ | fea) gor "I'd = 
18|0 | f18| 0 | #8 | et] 62) sa} o¢ | oc] oF | st 2} ett en! or} st ¢ | #18!o0 | S19 Te 29 | 3&8 aL | 9 OL | F-FT ‘HL ‘f£ | 92 
18/0 | fts|o0 | 2/0 | fo9| st! 29 | Le] 6¢ | or] fst | or] 8 | ot] f69 or] fts|/o | 22) St | 18) st | o8| se OL | %FT| ‘H'L‘D°A 
e9 | LT] fo. | or} 32/0 | foe | OL] 69| ct] ¢t\o | fe | og} Le | 08 | f69 | or | FFL | F o¢ | 6& | fet | ¥@ 89 | 49 | fF9 | 2-41 ‘)'O'U 
| _ a | D | | 
GL | L tL | 9 0 | fo. | se] f69 | 9¢ | cs | o | f2¢ | og | F9¢ | oz] ze oc | F92 | ¢ e¢ | 98 | fs¢| es 99 | Sb ZF | 9ST ‘) ‘H'I 
0L| 6 | fe2 | 6 0 | 02 | se] fet | ce] ec | er] fee} o | ct) ot] 62 2 | 2) F | 9 | 6 | e8| at #08 | oF 66 | ost) =p 07 
149 | 81 | 99 | &T 0 | 92/8] 242) 8c] 6¢ | or] 8 | Of | 9 | OF] LF Oz | f09 | Zr} TL| #% | ch | 9F | 09] 28 €L | FT “T'O'V| 02 
f6¢ | st | F2¢ | LT 0¢ | foe | o2 | fet 82] ot | se] for | ox | fcr | oe} 8 | or] e¢| st] oc) 6¢ | 12| 19 | e¢| set] co| 8tt) ‘a’M‘H 
| | 
fon | & | fo. | OT @ | FL | se] 69 | 9¢ | 6¢ | or} TL | oz | F219 | 0z | ¥69 «oT | fst | z #L | 06 | fon | 9% 6L| 8h | fs | ST O71 'V 
, 9¢ | 08 | fF9 | FT ot} ics | 8 | 08 | se] 2) 0 | fe | og] ze! oe] fe9 orl c9|8 | oz | zt | zz} ge | | o¢ | 08 | tLer ‘d ‘H 
S 1L|8 | f2¢ | LT 0 | 99 | oF] f69 | 98] 2/0 | 29 | 8c] ze] oe] 09 at] fo] 2 | 62! ot | zo | ee | oh | g¢ €8 | FET | ‘a 2°" | AO 
: ee ——}-}_+_ |} | — 
& & Zz L 
o fst | 9% | 19 | ST LE] foe | OL} o¢ | e9 | Ge | oc] f12 | 09 | fet | se | Fee oc] e¢| st] _te| o¢ | oz] co | oe] oct | 9 62t| “Ma ‘H| o9 
; fo | 6 | $19 | ST & | foe | OL] of og} 6 | oF] f9| e¢] 9 | oF] 8 | or] 02/9 | fre} se J or] Tt | oF) ott | foo | ost) ‘MT ‘a 
= fer | 96 | $19 | eT SG] ST | G8] 1 | 29} 68 | oc] SF | OFF Le | Of | Foo | Gr] So) zt] FF | ee | fee | 8h | BF} LOT | FFF | ¢-eT| ‘dA ‘dN 
Lg | $8 | Fze | ae ¢ | 66 | 8¢] fF9 | oF] co | et] fst | or] 18| or] zs o | 6¢ | 62] f19 | Te | 9b) St | 69} 82 SI | LT ‘ape 
feo | 12] SF | SZ 0 | feo | o¢ | feo | 26] fot | oF | Fte | 09] 9 | oF] 8 | oF] fr | cz] _6¢| ce | re| to | ee] itt | fee | eet) ‘N“L‘O 
84/8 | 22/9 OL] 29 | 4H] Sh | sot 6e | OT] fL¢ | og | ¥z9 | oc | 24! oc | ¥92 | ¢ | fre] 8h | fer] se | 9g] 68 T€ | 1L-ST| ‘Nd “] 09 
Fe | 98 | fer | oe OG] FF / 9] SF | oo] sc| og] fiz} 09] t9| ec] 24e oc] 2 | 69} f92/ €¢ | og | ge | oT] zat] 29/ oft! ‘NO 
Tr | ce | FoF | 9 /§ | 98 | eS] we) 19} o2/0 | 92) sty gL) Gry LF oc] oF] He] TH] FF YT gL | ez | F¢ | 16 at L9T | "N 'N 
§1e | 6e | ee | ge for Olf 08 | Sf 08 FEF cL!) 0 | ite | og] ze loge} feo et] 6¢ | 6c] 92] 6t | fe9| sz | 9] 92 08 | IT-&T | I ‘ea 
2) ¢ | 29) Ir] Ot | os} St | GL] If Is] 12 | rr] fF se] for! ce] 92 og] t9/6 | 21) 29 | to) ze | for] sor] 22) OFF) “T'N‘H 
89 | LI} 6¢| 9T] for Or} It | ss] foe so] e6¢ | or] fr sey ze | og] fre sz} 2g | et] Foz) e¢ | for) oF | for | sot | fre | @er| "ek! 99 
48) PE] 08 | 96] Te | OF] Gs] LE] FoF | Fo] GE Oc] Fe Og] Le | Og] 2h oc] 6T |} OF] FT) 69 | Le) eg | fT] sot | fee | L-eT ‘HOV | 
| | 
99 | FIT] st} oat ato 9¢ | $9] 0g | og | 82 | og | Fe oc | Ft | se] s | or | fF | ea] foe | Fe ue | 8g +P} or | 22 | T-9T| ‘a I‘a 
89 | 8 tr9 FI] tO | fo. | sey fcr | ce} sz | og] fe! oo] ze) oe] zr 02 cL |¢ | #19| 18 | TF) LF 6¢ | &8 69 | 8-91) ‘V'N ‘OH 
9¢ | 03 | fe¢ | IZ] SL O 6F | 8¢ | Fe¢ | Le] fet | or} 89 | eey Le | og | foe | 6t | foo | at} 89 92 | fs9 | 22 | fon | 9 | 9) LT) ‘VOM /v'A‘'O 
th 2 | CH eo) ES ae | — Fee: oe ae) a ee eee 
ee | eo] sr] eo] 2) 0 | for os] tse | oF] Frz | ce] 62 cof ze og] se! ec] se | ee] er] ep or | Lb 42 | sot | 8 | GLT| ‘“M‘L‘M| 09 
SF | 62] fer | 82] eo 8 | cr 9] fos se] fe joc] re og] ot et] 09 st] _t¢ | oc] st) oF | og) ¢¢ | oF| ett | aF| 9T| "MO | 
oF | 08 | fee | ¢ | for | or | fo. | se] fro | oF] oF | st] 6e sr] ze | og] 92 og | itr | sc] ire | sp | fer) Fe | ze | oat | oz *91| ‘L‘H'S 
fie | 6¢ | To | ce] for | or | foe | oz] ae ont 6 | OL] TL OG] ¥29 | OF] 09 st} fer | Ta] 6e| oF | 16 | se | fer | gor | s¢| 6ST) ‘“L‘D'a | 
ve | 98] 6 | TE] TOF | OT} tee | 99 | tee | 09] gL |o | 2 | Oc] f49| os] #26 | Fee | ce] ze!) Le | feo | 8a | se | ort | f4¢ ort) 86'S T'S 
€9 | LT] FoF | 92 Ll | 0 8L | 9% 68 | 8ST cL io §x¢ | og | f9¢ | 62 LE | 0G e¢ | 8ST 08 Lai 9g rE 69 | 99 T | OT-8T ‘Sul 'S | oF 
fon | LL | 9 aL | oO | fs¢ | 1¢ | f09 | I 6¢ | OT} o¢ | set st} ze] fet | ee | Fez | z 99 | 98¢ | feo | se | fo. | go | fee | ¢-9T ‘d “1H 
o€ | IF o€ | 98 ZL iu 9L | 82 1 ¥69 | 98 99 |Z *9 | 64 Le | 0&8 LE | ¥% ze | Té ZL | T% Feg 83 19 08 $e | OT-2T! “dH °O 
; Z| & sr | 9-9T 
| eo | for | 09 1 6g | oc | 12] 09] ze | oe | 9¢| og | _sz|6ef of| to pfrt| se | ee| cor | er | 9-9T 





eel Ene | arl er 


> 
> 
= 
~ 
N 








DARWINISM, BIOMETRY AND SOME 
RECENT BIOLOGY. I. 


By KARL PEARSON, F.R.S. 


(1) THE principle of the “survival of the fitter” as the basis of progressive 
evolution depends upon the co-existence of three factors, (i) the destruction, or 
elimination as far as reproduction is concerned, of those less fitted to their environ- 
ment, (ii) the inheritance of the somatic characters on which the fitness depends, 
and (iii) the absence of any differential fertility. These essential principles of 
natural selection are frequently overlooked not only by sociologists but by biologists, 
and whenever they are disregarded, there an inroad of a serious nature has been 
made into the Darwinian theory of evolution. It is possible to illustrate this point 
in a variety of ways. Thus Nature in the struggle for existence in wild life, regards 
and can regard only somatic characters. A somatic character which is not inherited 
in a greater or lesser degree has no value in the Darwinian theory of evolution. 
Again, a genetic constitution which does not manifest itself in somatic characters, 
i.e. is not directly or indirectly correlated with some one or other somatic character, 
cannot form the subject of Darwinian evolution. To what extent are patent 
qualities inherited? This must always be the fundamental problem of any 
Darwinian interpretation of living forms. And—above all in populations where 
selective mating is small—this problem must and will remain in the first place 
a statistical problem. The knowledge of an universal physiological law of heredity 
would immensely aid artificial breeding ; it would essentially aid civilized man in 
the generations to come, when his knowledge dictates and controls his emotional 
needs*, But from the pure standpoint of Darwinian evolution, the fundamental 
problem remains the intensity of the transmission of somatic characters in the 


* There is not the least doubt that good physique generally and that mental capacity in many 
cases, can in the present state of human development have sexual attraction; deformity and imbecility 
produce sexual repulsion. With greater and more widely spread knowledge of genetics, the gametic 
constitution of an individual as roughly manifested in the average somatic characters of other members 
of the stock may well come to be a potent emotion-controlling factor of a practically unconscious kind. 
Even to-day many human beings would, as far as matters of sex are concerned, experience revulsion 
rather than attraction from an individual whose relatives were physically, mentally or socially defec- 
tive. There is no doubt that this emotional state will be emphasised the more we are trained from 
childhood to give the proper relative weights to environment and heredity. 
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general population of any species. Those who desire to test the efficiency of 
Darwinian theory must examine nature in the first place from this standpoint. 


(2) Again, recent work on-evolution seems to have very largely disregarded 
the direct possibility of testing in wild life, or under artificial conditions, the 
destruction of the less fit. The correlations between death-rate and environment, 
and between somatic characters and death-rate, are essentially problems which can 
be treated by biometric methods, and which are vital to further progress in our 
study of evolution. It is true that human vital statistics present an immense 
material for the relationship of death-rate to environment; but they have not 
hitherto provided the much needed measures of the somatic characters of the 
selected. What is much needed is a widening of the breeder’s outlook,—that he 
should not only find what is inherited, but that the age at death and the fertility of 
his stud should be recorded with special relation to the somatic characters of its 
members, and to marked differences in environment. Light, warmth, food are all 
in a high degree within his power of differentiation, and his greatest difficulty, 
environmental growth changes, ought to be surmountable by the use of standard 
populations. For the purpose of evolutionary study, once the intensity of heredity 
has been determined, the physiological process of heredity, important as it is 
for other branches of science, is secondary; the next important step is the 
correlation of death-rate with somatic characters. I venture to think that more 


experimental work than has been recently attempted might well be under- 
taken in this direction. 


(3) Lastly I would refer to the third important factor of evolution—the 
absence of a differential fertility. This point, it seems to me, we again and again 
overlook. It is extremely hard to believe that fertility could by any mechanism 
come to be highly correlated with the peculiar type of character fittest to survive 
in a given environment. The only easy way to suppose it lies in the case of those 
species in which length of life is at once a measure of fitness and a measure of 
fertility. We cannot apply this consideration, however, to the case of species 
which breed only once. If on the other hand fertility were inherited, and were 
at the same time correlated with other characters, it is difficult to believe thai 
a selective death-rate could perform its functions, except in the case where the 
maximum of fertility coincided with the optimum of the selected characters. 
Personally, the truth that natural selection presupposes the practical absence of 
genetic selection, has only gradually been forced upon me by the discovery of case 
after case in which there was little or no demonstrable inheritance of fertility. 
I began with the conception that fertility would be found to be a markedly 
hereditary character ; I expected to find genetic selection masking or even reversing 
natural selection. Then, as very small or even insensible values came out for man, 
horse, swine and mice, I have been forced to the conclusion that the smallness of 
the hereditary factor in fertility is an essential feature of Darwinian evolution. 
I should be surprised to find a large inheritance of fertility now, just as I should 
be surprised not to find a large inheritance of any somatic character. And on the 
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evidence we have at present, I should not expect to find a correlation between 
fertility and a somatic character; if it does exist in any case, I should anticipate 
very rapid changes going on in that species. 

(4) One of the most recent investigations in this direction is that of Raymond 
Pearl on the inheritance of fecundity in Barred Plymouth Rocks. His paper on 
the number of eggs of this race is an extremely interesting one. It is true that 
he only gives the crude correlations and does not give the data which would be 
necessary for their correction. Thus he had 31 mothers* and 180 daughters, and 
some of his “arrays” were probably due to a single mother bird, but this cannot 
be ascertained from his Tables. In working at fertility in man I found exceedingly 
small values, until correction was made for weighting of the more fertile mother, 
and then the values appeared to be small but appreciable. If we put aside, how- 
ever, the small number of mothers used by Pearl and this question of correction, 
there cannot be the least doubt that Pearl’s data show a low, if not zero, intensity 
of heredity in fertility, thus confirming in poultry what we already know of many 
mammals, i.e. the absence of inheritance in the case of fertility which seems 
a necessary foundation of the idea of Darwinian evolution. 

Pearl does not take this zero correlation as confirmation of existing results for 
fertility. He considers it in sume way a confirmation of the pure line theory of 
Johannsen. We are sorry that we cannot in the least agree with him on this point. 
For if his view were correct, every high correlation would tell against such a theory, 
which it certainly does not; and in the case of mammals, where the fertility 
correlations are zero and the body characters highly correlated between parents 
and offspring, one correlation would refute, another confirm the pure line theory 
in the same species. Nor again can data taken from measurements of a character 
highly influenced by environment be in the least conclusive, if they are handled 
in the manner recently adopted by Jennings, who selected a few individuals with 
large values of a character and compared their offspring with the general population. 
There is no security whatever that the individuals selected did not have excess of 
character owing to environmental and not to hereditary conditions. Such sweeping 
and dogmatic statements as those made by Jennings that: “The ‘standard devia- 
tion’ and ‘ coefficient of variation’ express in a pure race mere temporary conditions 
of no consequence in heredity” merely beg the question of what a pure race may 
be, or else show a sad ignorance of what work has been done on this very point. 

The only manner in which it appears to me that an answer can be found to 
the question which Pearl and Jennings take as now settled—that is to say, the 
problem of whether or no selection within the individuals descended from a single 
unit can or cannot produce an effect—is to correlate ancestry with offspring in 
species in which self-fertilisation or parthenogenesis takes place. If this be done, 
then the relation of offspring to parent ought to be identical with the relation of 


* Nothing short of 500 mothers would really give a convincing result, I think, where the actual 
correlation is so extremely small. See Pearl, R. and Surface, F. M.: ‘‘Data on the Inheritance 
of Fecundity obtained from the Records of Egg Production of the Daughters of ‘200-Egg Hens’.” 
Maine Agricultural Experiment Station, Bulletin, No. 166, 1909. 
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offspring to grandparent. Warren and Johannsen himself* are the only investi- 
gators who, so far as I am aware, have hitherto provided material in sufficient mass 
for approximately answering such a question, and what do we find: 

Johannsen: Phaseolus vulgaris. 


Correlation Regression 


Offspring and parent... ot 35+ °01 27 
Offspring and grandparent ~ 244-01 ‘11 
Warren: Daphnia. 

Offspring and parent... — “47 + 05 "62 
Offspring and grandparent am 27 +12 50 
Warren: Hyalopterus. 

Offspring and parent... ous 40 + 03 “47 
Offspring and grandparent me 24404 ys 


In every one of these cases, although, as the probable errors show, not based 
on sufficient numbers to be absolutely final, the parental is higher than the grand- 
parental correlation. The ratio of the two is almost exactly the 3/5 found for 
the same ratio in mammals with bisexual generation. Pearl and Jennings put 
such results—and many others of the Biometric School—on one side, without even 
reference; on what bases? The former on the foundation that fertility is not 
inherited in hens—an interesting result but one that might be predicted from what 
we know of mice and swine. The latter because he has failed under the method 
adopted by him to discover inheritance in Paramecium. From that he argues 
not only to the whole range of possible heredity in Paramecium but apparently 
in every living form, whatever its mode of reproduction. The fundamental con- 
dition of a crucial experiment of this character must be to show (1) on a sample of 
sufficient size that the character is inherited at all, and (2) that if it be inherited 
the relationship between offspring and grandparent is as intense as between 
offspring and parent. Whether the pure line theory be true or not, it cannot be 
demonstrated either by showing that a fertility correlation is zero, or by the mere 
ipse dixit of Jennings that “‘ variations’ are mere temporary fluctuations, without 
effect in heredity, so that their relation to evolution is nil.” 

No biometrician asserts that every variation is of hereditary value; on the 
contrary he asserts the very opposite whenever he tables a correlation less than 
perfect in parthenogenetic reproduction. What he does assert is that on the 
average such variations form in a measurable degree the mark of a certain 
associated gametic character. He would not have thought it worth while to 
try and determine the correlation of gametic and somatic characterst had he 
believed every individual variation capable of direct inheritance. Nor would his 
results be in the least upset if—and some theorists in heredity seem coming to 
that—every individual were in himself a “pure line.” He might mildly wonder 


* Elise Hanel’s results for Hydra grisea will be discussed at length below; the character of the 
selections made prevents full comparison here. 
+ See Biometrika, Vol. v1., pp. 90, 91. 
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how all these pure lines came into existence in a single local race on any philo- 
sophical scheme of evolution; but it would not affect his treatment of inheritance 
in populations. Why then should he criticise the work of Johannsen, Hanel or 
Jennings on “pure lines”? Simply because the experiments made by these 
biologists are in his opinion insufficient to provide a basis for the sweeping 
inferences drawn from them, and further because they contradict other biological 
work, which he thinks just as important and valid. The assumption made by 
the pure linist is that in reproduction from a single individual the same gametic 
constitution is transferred to every offspring. The biometrician sees no sufficient 
justification of such a view in our present state of knowledge; two seeds from 
the same self-fertilised plant may differ in gametic constitution, and this differen- 
tiation may appear in the correlated somatic characters. Two Aphides from 
the same mother, according to Warren, do show such differentiation. Does or does 
not differentiation follow the division in Paramecia? We do not know, because 
no heredity of the “pure lines” has yet been shown to exist for the character 
selected by Jennings in the only way that would satisfy a statistician, i.e. a corre- 
lation table of the pure line parents with their offspring. On the other hand, 
having convinced himself that size of litter, for example, is not inherited to any 
appreciable extent in mice, but that colour and area of colouring are, he is not 
likely to accept the ccaclusion of Pearl, based on the non-inheritance of fertility 
in hens, that “the chief if not the entire function of selection in breeding is to 
isolate pure strains from a mixed population. It is found in actual experience 
impossible to bring about by selection improvement beyond a point already existing 
in the pure (isolated) strain at the beginning” (Joc. cit. p. 79). It is difficult 
to understand where in the pure strain at the beginning there existed the chief 
characteristics of any of our domestic animals of to-day. Such statements appear 
too sweeping in the light of any evidence at present available; they discard the 
Darwinian spirit of slow and patient inquiry, at the same time that they destroy 
the Darwinian theory of selective evolution*. 

That selection in a particular direction cannot go beyond certain limits is of 
course a commonplace of breeders, indeed it was illustrated at a very early stage 
by one of the first of biometricians, Sir Francis Galton himself, in his discussion on 
organic stability. The selection of any one character changes all correlated 
characters, and a condition is rapidly reached at which the organism is either 
incompatible with its environment, or possibly its germinal determinants become 
unstable and assume a new form of equilibrium. 

(5) While Jennings himself, as I have said, provides us with no data upon 
which we can test (i) whether the character he is dealing with is really inherited 
in Paramecia, (ii) whether in an unselected population the degree of resemblance 
of parent to offspring is no greater than that of grandparent—while in fact both 
Pearl and he appear to confuse non-heredity of a character with pure line heredity 


* Pearl, R. and Surface, F. M.: ‘Is there a cumulative effect of Selection?” Zeitschrift fiir 


induktive Abstammungs- und Vererbungslehre, Bd. u., 8. 257, 1909, answer a very big question from the 
same egg data. 
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—yet he has given his approval and sanction on two separate occasions* to a 
memoir by Elise Hanel+ on asexual propagation in Hydra grisea. We have 
thus a very fair test of what his judgment on a statistical proof of “pure line” 
heredity is worth. 

I propose to discuss at some length Hanel’s paper, because it seems to me 
that no definite conclusions as to “pure lines” can be obtained by investigations 
of this type, until the fundamental features of the research are modified and 
the observations treated with an adequate theoretical knowledge of statistics. 
Hanel begins with a very careful investigation of the growth and environmental 
changes in the character selected—the number of tentacles in Hydra grisea. There 
is general agreement with Parke’s results that the number of tentacles changes 
with age, size, food and place of culture. Differences in these factors can produce 
very considerable differences in individuals, and differences in the averages of diffe- 
rentially treated groups, which can amount to as much as 0°5 to 0°8 of a tentacle. 
These are precisely of the order of the average hereditary differences. Thus: 


| | 
PARENT | OFrFsPRING 


| 


Number of | Number of | Number of | Average of | 


Individuals | Tentacles | Individuals Tentacles 
| 


| 


| 
| 
| 
| 


| 

9 6 364 6-943 
| 9 7 | 310 7-296 

4 8 166 7344 | 
| 4 9 | 195 7383 


| 


* «*Heredity, Variation and Evolution in Protozoa” m., Proc. Amer. Phil. Soc. Vol. xiv. 1908, 
p. 521, Jennings definitely asserts that Johannsen in 1903 showed that in beans and barley selection 
within the pure line had no effect. He then proceeds to state that Hanel has found the same state of 
affairs—no selection within the pure line will produce an effect—in Hydra. In a second very dogmatic 
paper “ Heredity and Variation in the Simplest Organisms,” The American Naturalist, Vol. x11. 1909, 
p. 332, Jennings writes: ‘‘ Work with ‘pure lines’—where no intercrossing of races or individuals 
occurs—is possible with few organisms, and little of it has been done. In the few investigations 
carried on in this way, the same conditions have been found that we have set forth above for Para- 
mecium. They were first shown by Johannsen to hold for beans and barley, and later by Elise Hanel 
for Hydra. The fact that there exist diverse races, tending to breed true, has of course been shown for 
many species, but in most cases it is difficult to maintain pure lines, and thus to absolutely demonstrate 
the relations above set forth, as has been done for beans, barley, Hydra and Paramecium.” The italics 
are mine. He also cites Barber “‘On Heredity in certain Micro-Organisms,” The Kansas University 
Science Bulletin, tv. 1907, pp. 1—-48. Now several comments are needed here. Jennings directly 
overlooks the researches of the Biometric school, which was first to work in the field of pure lines, and 
actually found (see Warren’s papers, R. S. Proc. Vol. 65, 1899; Biometrika, Vol. 1. p. 128, 1901) 
that variations within the line were inherited. The same result appears to flow from Johannsen’s and 
Hanel’s own experiments, but the characters they have chosen are so influenced by environment 
that they have not observed the fact. It does not flow from Jennings’ or Barber’s experiments because 
they have not demonstrated that heredity exists at all for the characters selected. In fact Jennings 
repeatedly (and Hanel also) illustrates how markedly environment dominates the characters taken as a 
measure of heredity. Jennings then assumes that by selecting a character, the heredity of which he has 
never demonstrated, he can reach a general and ‘‘ absolute demonstration” of the truth of the theory of 
pure lines! 

+ ‘Vererbung bei ungeschlechtlicher Fortpflanzung von Hydra grisea,” Jenaische Zeitschrift, 
Bd. xxi. 1908, pp. 322—372. 


Biometrika vii 48 
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It will be at once recognised that the differences here are rather less than many 
of the environmental differences, and there is no security whatever that these 
26 foundation stock Hydra are really represented by differentiated hereditary 
numbers of tentacles. Yet this table as it stands embraces Hanel’s proof that 
number of tentacles is an hereditary character in the “pure line.” What evidence 
is there that any one of the numbers of tentacles attached to those 26 parents is 
truly constitutional and not environmental? In every generation that follows 
Hanel selects from each line two or three individuals to be parents of the following 
generation. In selecting a few isolated individuals in each generation, where non- 
hereditary influences are so influential, we may break the effect of heredity at 
each step, and since such influences are equally effective with heredity, the chances 
are that we shall do so once in every two selections. Only by taking large numbers 
of the high and large numbers of the low, would it have been possible to average 
out the effect of environmental changes. When the line is carried through two 
or three selected individuals only, there is no certainty that the rion-hereditary 
variations will average out. Of course this statement proceeds on the assumption 
that heredity is small in this case compared with other factors. Table I gives 


TABLE I. 
Number of Tentacles of Stem Parent and Offspring. 


Number of Tentacles of Stem Parent. 
; i ; | 








2 5|6]7 | 8 |9110| 11 | 12 Totals 
= | | | | | | 
See atone. Tete oes 
ai 6 24| 81| 54| 25/12| 1|/—| 2] 199 | 
Se 7 70 | 189 | 1386 | 75 | 36| 8|—j| ‘8] 472 | 
‘sm 8 23| 44| 93| 50| 22) 1|—|14] 247 | 
> 4| 4| 93] 14] 8] 6|—|—] 50 | 
g- | 0 J—|—]| 4] 1] 2] 3|—| a) 0 
so | i - es ee ee ee ee 1 
7, | 





Totals | 122 | 268 | 310 | 166 





80 | 19 | — | 25] 990 
Hanel’s data for the 26 original parents and their immediate offspring*. The 
statistical constants obtained are+: 


Parents (26) Offspring (990) Correlation Regression 
Mean ee ae 6996 7201 ‘230 "138 
Standard Deviation 1°435 862 +011 +°013 


* A protest must here be raised against the manner in which this author’s data are tabled. No proper 
description is given of the tables, and it is quite impossible to pick out the pedigree of any individual 
beyond the parent. Further, instead of the frequencies of offspring with each number of tentacles being 
given, the total number of offspring is stated and the percentages with each number of tentacles. A 
laborious process had therefore to be undertaken to replace Hanel percentages by the original frequez:cies. 
It was then found that many of this author’s percentages must be wrongly calculated, for their totals 
came to 111, 93, etc., instead of 100, while in other cases it was impossible to determine where one 
or more individuals should be placed on the basis of the percentages given. It is impossible to make 
anything of a return in which 13 individuals are said to occur in percentages of 15, 31,40 and 8! On 
this account 21 families had to be dropped, out of 245 for which data are provided. 

+ I have to thank most heartily Miss Ethel M. Elderton for the whole of the arithmetical reductions 
of Hanel’s data. 
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Now these results bring out several interesting points. It is doubtful whether 
the probable error of the correlation ought to be calculated on 990 cases and not 
rather on the number of parents, 26. Further, these parents are far from a random 
sample; the s.D. as compared with that of the offspring shows that the parents 
have been selected with markedly high and low numbers of tentacles. The 
regression has thus more meaning than the correlation*. We see then that 
tentacles in Hydra are really inherited, but with a value ‘138, only about a third 
to a fourth of the intensity of heredity in many other characters in other species— 
compare Warren’s values in the case of Daphnia and Hyalopterus. Let us see 
exactly what this means: it signifies that if we were to take a species in which 
heredity was as marked as in most characters in man, or as marked as in Daphnia, 
we should have to mix it with two to three times as many pairs in which there 
was no resemblance between the character in order to weaken down the measure 
of hereditary resemblance to that manifested by Hanel’s Hydra. In other words, 
a mixture in which one-third only exhibited normal hereditary values of the 
character and two-thirds characters having no hereditary significance, would 
produce the same intensity of heredity as is manifested in this materialt. What 
chance, we may ask, has any selection of two or three individuals at most out of 
each family of 20 to 40, got of exhibiting continuous selective influence ? 


To test the generality of the above results a Table was now formed (Table IT) 
giving the correlation between all available parents and their offspring. We find: 


Parents (224) Offspring (6436) Correlation Regression 
Mean ae a 7234 7275 "182 "135 
Standard Deviation 1°225 908 +008 + 006 


Thus we clearly see when we pass from the 26 foundation stock to the available 
224 parents, that the material has not been so markedly selected, but parents are 


* The regression would not be changed by the selection of parents, and would represent the correlation 
in a stable population. 

+ The low value of the correlation between parent and offspring is undoubtedly due in chief part to 
the variation with growth in the number of tentacles. This point has been demonstrated by Albrecht 
Hase in an interesting memoir (“ Ueber die deutschen Siisswasser-Polypen Hydra,” Archiv fiir Rassen- 
und Gesellschafts-Biologie, Jg. vit. S. 721). He shows that the average number of tentacles increases 
in Hydra grisea from the first day of separation as follows : 


1st Day 20th Day 50th Day End of Culture 
6°14 6-71 8°42 9°14 

It is clear that the hereditary factor must be obscured by such growth changes. But I do not think 
Hase is correct in his conclusion that the number of tentacles has no hereditary value; my analysis 
shows that the hereditary value is there, but much obscured. Hase holds that Hanel’s experiments 
have shown complete regression in the line, because she is dealing with a character which has no 
‘« Erblichkeitswert,” and that she has only thus obtained results in accordance with Johannsen’s. Iam 
entirely in agreement with him, that a first condition of any experimenting on ‘‘ pure line” material, 
is to show that the character chosen has “ Erblichkeitswert.” This both Pearl and Jennings have 
failed to do. But Hanel’s data do exhibit ‘“ Erblichkeitswert,” if only in a very attenuated degree. 
But to the same degree they show, what Hanel and Jennings imagine they do not show, that regression 
does take place within the pure line, and that properly instituted selection would produce effect within 
the limits of the pure line. There is a similar attenuated ‘‘ Erblichkeitswert” in Johannsen’s own 
material which shows in a similar manner to Hanel a partial regression within the pure line. 


48—2 
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still somewhat selected material as compared with the offspring. Taking, however, 
the regression coefficient as less influenced by such selection, we see that the 
intensity of heredity is essentially the same for all parents as for the foundation 
stock parents, i.e. 135 and ‘138. 

TABLE II. 


Number of Tentacles of Parent and Offspring. 


Number of Tentacles of Parent. 

















s | sj 6|7|s | 9 | 10 | 11 | 12 | Totals 
3) | 
$.| 6 1 | Samy Phe py ee, ye pl (Ps 1 
SP! ¢é 38 | 436) 440! 290 81| 20! 5| 8] 1318 | 
ee | y | 86 | 862) 875) 724 254) 81) 5 | 20] 2907 | 
a | 8 33 | 403| 501| 493 184| 67| 4| 20] 1705 | 
5 | 9g 4| 75| 114| 134, 62| 95) — | | 425 | 
Bo | 0 f-| - | je 6 te eae 50 | 
o| 11 —|/—} 2] 2) 9} 7T/—|— 30 
= | | | 
Z | Totals} 162 | 1776 | 1942 | 1666 | 603 | 213 | 14 | 60 | 6436 





We have now to consider the next stage, which presents considerable difficulty, 
for Hanel has not preserved all the offspring in order to reach a random sample of 
the second generation, but preserved two or three from each group only. We took 


TABLE III. 
Number of Tentacles of Stem Grandparent and Offspring. 


Number of Tentacles of Stem Grandparent. 








3 | 5 |6 | 7 | 8 | 9 | 10| 21 | 22] Totals 
5 | | ee 
= | 6 | 15|149| 157| 47] 28|14| —| 18] 428 | 
2-E y | 61 | 243 | 249 | 149] 92| 10 — | 26] 830 
S| 8 | 44/116| 121 | 124] 66| 6 — | 22] 499 
6 | 9 i} Se | SS) S| HB Ste 115 
- 5 10 Ee ee 4 2; 1 1}/—;— 8 
Bs | ne | — | = | — ey f= 4 | 
Z | Totals} 134 | 540 | 564 | 344 | 202 33 | — | 67 | 1884 | 








however all the “ grandchildren” of the original foundation stock with the following 
result : 
Grandparents (26) Offspring (1884) Correlation 


Mean = as 7199 7181 030 
Standard Deviation 1-288 884 +016 


Now on the pure line theory the resemblance of the offspring to the parent 
ought to be no greater than it is to the grandparent. It is here more than four 
times as great. Another very good test of the “pure line,” it occurred to me, 
might further be used to illustrate this point. If the theory of the pure line hold 
for Hydra, the relation of the offspring to the whole fraternity of which the parent 
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is a member should be as close as to the parent itself—either equally represents 
the character of the pure line. This method, as far as I can see, might be very 
profitably used to test pure line theories, because the parent will generally have 
been reared in the same environment as its co-fraternity*. Table 1V has then 


TABLE IV. 
Number of Tentacles of Member of Parents’ Fraternity+ and of 
Offspring of Parents. 


Number of Tentacles of Parents’ Brethren. 


j - = ——— 


| ARAB Se a 





4 | 3824| 7681 | 3143! 384] 70 
7 | 15! 6370 | 17869 | 7980 | 1587 

9 | 3322 | 10258 | 5919 | 1102 

9 6 | 733 | 2514 | 1235 | 338 

| 





10 }—| 36| 167| 93) 35) 6|—) 337 
i 5 91 | 4; 2] 8} — 154 


| Totals | 34 | 14290 | 38580 | 18415 | 3457 | 523 | 63 | 75262 | 





Number of Tentacles 
of Offspring 











been formed giving the relation between offspring and each member of the parents’ 
co-fraternity. We have: 


‘** Aunt or ‘* Nephew or Niece ” 
Uncle” (75362) Correlation Regression 
oe 7170 7213 091 "095 
Standard Deviation 816 851 +°002 +003 


It is clear that the correlation between parent and offspring is double what it is 
between the parents’ co-fraternity and the offspring, a result wholly inexplicable 
on the theory of the pure line. 


We can follow this idea further and remark that the degree of resemblance 
between every member of the pure line ought to be exactly the same. There 
ought to be no distinction at all between the correlation of individuals with their 
ascendants, their descendants and their collaterals in all grades. Now in dealing 
with man and with mammals generally, the biometric school has found the 
resemblance of brethren to be slightly greater than that of parent and offspring, 
parental correlations running from ‘4 to °5 and fraternal from °5 to °6. For 


* Ido not see further why it should not be applied to test ‘‘pure lines” in the case of bisexual repro- 
duction, where it is met at once by the fact that the relationship between uncle or aunt and nephew or 
niece has only about one half the intensity of that between parent and child; yet the parent and the 
uncle or aunt are both products of the same pair of ‘pure lines.” If it be said, which I am ready to 
admit, that the gametes of one individual are not all alike, that on bisexual reproduction the zygote will 
not always be the same, and that the somatic characters will be related to these selected gametic 
characters, the obvious reply is that there is no evidence at present that, either in budding or in 
parthenogenetic reproduction, there is always a transfer of precisely the same type of cells as basis for 
the new individual. 

+ Parent excluded, i.e. “ Uncle and Nephew.” 
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greyhounds, Barrington, Lee and Pearson found the ratio of parental to fraternal 
correlations to be *466 to ‘529, for the horse °522 to ‘633, for man °460 to ‘519 ; and 
TABLE V. 
Number of Tentacles of Pairs of “ Brethren” (Siblings). 


Number of Tentacles of First Sibling. 


























r) "ee Bee Bee oo | 10 | 11 Totals | 
a. | } | 
vo 
SP] 5 [- 5 | 23| 11 3| — |— 42 | 
gs| 6 5 | 1878 | 3599 | 1426 | 231 28) 7) 7174 
Siz | 7 | 23 3599 | 3968 | 4473 | 941 | 114 | 20 | 18138 | 
=A) 8 |11| 1426 | 4473 | 2880 | 723 | 107 | 12] 9632 
SU] 9 3| 231 | 941| 723| 254] 63/ 7] 2222 
» 8 | 10 f—| 28] 114 | 107} 63} 10) 1) 333 
rie 11 — | 7 | 20 12 7 1) — 47 
R | Totals | 42 | 7174 | 18138 | 9632 | 2299 | 323 | 47 | 37578 | 
| S ; ~ : I | 


Barrington and Pearson for shorthorns ‘40 to ‘53. Again Warren found for mean 
results for Hyalopterus and Daphnia ‘500 to ‘656. It will accordingly be clear 
that the ratio of fraternal to parental correlation lies between 11 and 13. If we 
suppose the weakening of heredity to be as great in the filial as the parental 
generation, we should expect fraternal resemblance in Hydra to lie between 
11 x *136 and 1°3 x ‘136, or say between ‘15 and ‘18. Our results are: 


(37,578) Members of same Co-fraternity* Correlation 
Mean ... een dies 7-212 


Standard Deviation ... aa "167 + 003 


The correlation of “brethren” is thus significantly greater than the corrected 
correlation, ‘135, of parent and offspring, the actual ratio being 1:23. It differs 
also significantly from the relation between “uncle” and “nephew,” the ratio in 
that case being 1‘76, which compares closely with the value for man ‘50 to ‘30, 
ie. about 1:67. It appears impossible therefore, from the three tests made (i.e. 
comparison of the parental correlation with the correlations of individuals with 
their grandparents, with their “uncles or aunts” and with their brethren) not to 
admit that the individual has a special relation to each grade of its line, and 
is not equally related to all members of its line. 


Of the results we have reached that for the foundation stock and their grand- 
children (03) is, perhaps, the least satisfactory; it is so low. But the source of 
this is fairly obvious. Out of each array of offspring of the 26 original Hydra, one, 
two or three, in one case only, four, individuals were selected to carry on the line. 
These individuals were not selected at random, nor were they taken proportionally 
to the frequency distribution of the tentacles in the offspring. The result is that 


* Pairs of ‘brethren ” in the first generation. 
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the grandchildren as they stand are not random samples of the grandchildren 
which would be due to the original individual. They are arbitrary and not properly 
representative groups. Thus the, correlation of the foundation stock of 26 with 
their 1452 great-grandchildren is ‘059, greater than the value, ‘030, of the same 
individuals’ correlation with their grandchildren. It may readily take any small 
value, because the grandchildren and great-grandchildren are not typical of the 
stock of the foundation individuals, there having been either one or two selections 
of two or three individuals of a quite arbitrary character in between. While 
therefore the correlations for these more distant generations are lower than in the 
case of parent and ofispring, they are quite erratic, and I see no obvious method of 
allowing theoretically for these arbitrary intermediate selections of parents. It is 
safest to base the fundamental criticism on the result that the correlation of 
brethren is sensibly higher, and of “aunts and uncles” very sensibly lower than that 
of parents and offspring, a result absolutely opposed to the theory of the pure line. 


(6) But if those who lightly accept without criticism such results as those 
of Hanel have not the statistical training necessary to measure degree of resem- 
blance, they might at least have examined Hanel’s own tables carefully, and com- 
pared the average number of tentacles in the offspring of the parents possessing 
the highest and lowest numbers of tentacles which occur in each pure line. They 


| Lowest Number | |Highest Number| Number | 





“‘Stammthier” | Line | of Parental | pone eg Mean | of Parental of | Mean 
| ‘Tentacles | ae | Tentacles Offspring | 
6 1 6 83 6°82 7 65 7°04 
6 = 6 66 6°84 8 13 7°46 
5 Q> 5 30 6°70 8 29 7°34 
8 3 6 143 7°42 8 199 7°43 
7 4 7 93 7°60 9 64 7°62 
6 5 6 202 6°69 8 77 715 
9 6 6 183 7°06 12 35 7°69 
7 7 6 32 7°03 10 31 7°66 
7 8 7 100 6°67 8 23 6°78 
8 9 6 22 7°32 8 101 741 
8 10 6 39 7°39 8 116 7°22 
6 11 5 39 | 6°90 7 85 711 
6 12 6 154 6°97 8 83 7°05 
7 13 6 89 6°35 8 50 6°36 
7 14 6 24 6°50 8 18 6°61 
8 15 7 21 6°93 9 56 7°73 
12 16 8 84 7°08 12 25 7°64 | 
10 17 8 31 8°23 10 116 8°45 | 
7 18 6 119 7°34 9 49 7:35 | 
5 19 5 49 7°06 9 26 | 7:30 | 
7 20 6 95 7°47 9 18 | 7:44 
6 21 6 103 7°49 8 39 811 
7 22 7 109 7°67 10 23 7°12 
5 23 5 44 7°22 9 85 7°40 
7 25 7 98 8°05 10 60 | 7°63 
9 26 7 “ | TR 9 78 7°25 
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The above Table has been prepared from Hanel’s means, because these are 
what Jennings had vefore him. Many of these means are, I have found, erroneous 
to a greater or less extent. Further, in a number of cases Hanel did not preserve 
as parents individuals with the highest and lowest number of tentacles occurring 
in the “pure line.” But what do we perceive even in the data thus presented? That 
although the character is very slightly inherited and largely the product of other 
factors than heredity, yet if we reproduce from the individuals in each pure line 
with (a) the lowest and (b) the highest number of tentacles available, the produce of 
the latter class has in 22 out of the 26 “pure lines” more tentacles than the produce 
of the former class. 


In other words, so far from Hanel’s observations demonstrating as that author 
and Jennings suppose that: 


(1) “In einer Population von Hydra ist die Selektion wirksam, innerhalb der 
reinen Linien ist sie ganz ohne Einfluss. 


(2) Im Falle 1 ist die Regression eine teilweise, im 2 Falle ist sie vollstandig,” 


they actually show that regression exists within the “pure line,” and that as far 
as Hydra is concerned the idly termed “fluctuating variations” have a real selective 
value. The “erfreuliche Uebereinstimmung mit Johannsen’s schénen Resultaten,” 
which Hanel says the experiments indicate, is real, but far different in its bearing 
from what that author imagines. The Phaseolus experiments of 1903 and the 
Hydra experiments of 1908 both alike indicate that the offspring are closer to the 
immediate parent than to other members of the same “pure line.” Yet they 
have both alike been accepted without criticism by distinguished biologists who, 
suffering from “neo-chytophilia,” seem peculiarly anxious to remove the only 
philosophical basis of Darwinism as a memorial of its fiftieth anniversary. 


(7) Let us recognise fully what the “pure line” theory leads to in the case 
of any self-reproducing organism. It assumes that the cells of an individual 
destined for the production of new organisms are all alike in character and that 
variety in the somatic characters of such new organisms, when resulting from 
a single individual, is not due to any differentiation in the groups of the cells 
from which the fraternal individuals arise. Every self-reproducing species must 
have originated with an indefinite number of pure lines—differentiation could not 
have been produced by selection—or else, differentiation must have been produced 
at some epoch by what has been termed a fit of mutation. Either solution, the 
initial indefinite differentiation of a species, or the epochs of mutation, appears to 
me to destroy Darwinism as a consistent logical system. What then? Simply, 
that if Darwinism be proved to be in error it must be dropped. But the proofs 
which are being accepted to-day, that it is in error, are no proofs at all. They 
are hurried conclusions drawn from observations, which, when treated by any 
adequate logic, demonstrate—even on ill-suited material—the very reverse of what 
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they are assumed to prove. I fully appreciate the careful experimental work done 
by both Jennings and Hanel; I am not pledged to any theory of life, or of inherit- 
ance; it isthe method and logic of their reasoning which concerns me in the first 
place. These are essentially inadequate and do not meet the points really at 
issue. That the degree of resemblance decreases in a geometrical progression, 
i.e. that offspring are more like their parents than their grandparents or great- 
grandparents, is not for me a theory, it is a deduction from the observed 
facts; it becomes a theory, which flows from an hypothesis, if you accept, say 
Mendelism*. Those facts would not be changed, if the theory of pure lines were 
demonstrated for either Paramecium or Hydra. But the establishment of that 
conclusion would affect biometricians in the indirect manner that any fundamental 
subversal of Darwinian principles destroys the only philosophical theory of life at 
present available, and so lessens the interest many feel in devoting time and 
energy to the measurement of life. 


(8) As I have taken Hanel to illustrate recent biological work on the in- 
heritance of variation, I will consider in this section another biological memoir, 
one on fertility. 


I have hitherto made no reference to a recent interesting paper+ by Captain 
R. E. Lloyd, LM.S. He states that he was led by some observations of mine on 
the fertility of Shirley poppies to question whether fertility was related to weight 
in rats. He gives two correlation tables which I reproduce below, but he makes 
no statistical reduction of these tables, but draws apparently from the mere 
examination of them the conclusion that gigantic and dwarf rats are just as 
fertile as common rats of average size. “There is clear evidence that the largest 
and smallest rats are quite as fertile as those of average size” (p. 264). 


Now the first impression I formed from Lloyd’s tables before they were reduced 
was that they showed quite sensible correlation between weight of rat and number 
of young. And this on reduction proved to be the case, the dwarf rats have fewer 
offspring than the average rat, and the large rats have more offspring than the 


* A curiously ignorant account of the biometric treatment of heredity has recently been given 
by W. Weinberg: ‘‘ Ueber Vererbungsgesetze beim Menschen,” Zeitschrift fiir induktive Abstam- 
mungs- und Vererbungslehre, Bd. 1. 1909, 8. 377 et seq. He does not appear while writing his paper to 
have known the difference between the correlation and regression coefficients ; and only when penning the 
last paragraph did a vague inkling of the difference come to him—through the writings of Johannsen ! 
He overlooks entirely the allowances for assortative mating in biometric work, and accuses the biometric 
school of neglecting environment, when at any rate in the case of man it is the only group, which 
by statistical method and by direct investigation has endeavoured to allow for it. Weinberg has stated 
that the correlation between parents and offspring observed by biometricians in the case of the 
phthisical diathesis is due to a mixture of classes having different tuberculosis death-rates ; and this 
regardless of the fact that (i) the data of the biometricians were purposely drawn from very uniform 
classes, and (ii) the correlation of husband and wife for the same material was shown to be much 
smaller than that of blood relations and in some cases zero. It hardly seems needful to reply to 
criticisms of this character. 

+ “The Relation between Fertility and Normality in Rats,” Records of the Indian Museum, Vol. 11, 
Part m1. pp. 261—5. 
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TABLE VI. 
Poona Rats. 


Number of Offspring. 
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average rat. If w stands for weight, f for number of young, then we have the 
following results : 


, 1003 Poona Rats 1006 Belgaum Rats 


Mean Weight # =113°92 128°84 

Mean Number of Young f= 5°49 5°38 
Tw= 24°86 25°35 
o,= 168 1-29 
Twy= 1670 1498 
7 = 1733 "1634 


As the above results indicate, the regression curves of fertility on weight are 
almost linear, and the average number of offspring increase at an almost uniform 
rate from the dwarf to the giant rats. This is well illustrated on the accompany- 
ing diagram. It will be seen at once that Captain Lloyd’s view is not supported 





Number of Young. 
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Weight in grs. of Mother. 
Regression Curves of Number of Young on Mother’s Weight in Rats. 
AA Poona Rats. BB Belgaum Rats. 


by his own data, There is a very sensible relationship between the weight of the 
rat and the number of its offspring. But it is not a relation in which the type rat 
is the most fertile. In other words the case seems at first sight precisely one in 
which the effect of reproductive selection would cause, without regard to natural 
selection, a progressive increase in the size of rats. But is this inference really 
justified? It seems so very important to determine whether any such cases of 
genetic selection actually occur that we may be pardoned for hoping that further 
information may be forthcoming. It is conceivable that in certain stages of 
pregnancy even the size of the litter might have some effect on the weight, but 
this is hardly likely to produce all the difference noted. Another important point 
49—2 
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is the question of age. Were the bigger rats on the average the older, and do the 
older rats have on the average larger litters? If anything of this kind takes place 
in rats, we should not be comparing the fertility of large and small rats, but the 
fertility of rats in different stages of development, which is a very diverse point. 
I am inclined to think that the real relation between number of young and weight 
of rat, although overlooked by Captain Lloyd, actually has this simple origin. It 
is suggested, but not demonstrated by the relation which is manifest in mice 
between size of litter and number of litter. Thus in 700 litters of mice recorded 
by the late Professor Weldon, I find : 


Number Average size 


First litter fy. ao 307 5°46 
Second _,, ite a 254 557 
Third , wat sts 139 5°76 


It will be seen that the size of the litter increased uniformly with the increasing 
age of the mouse, and there is little doubt that the weight of the mouse also 
increased with age. Only a direct experiment in which age and number of litter 
were taken into account could settle the point. In Captain Lloyd’s experimental 
weighing, there was no record of these characters. I have small doubt that 
the very sensible correlation between weight and number of young shown by his 
data is only a secondary result of the relations between age and weight, and again 
between age and number of young. It is not evidence of a real correlation between 
fertility and a somatic character, and thus has no bearing on genetic selection, and 


does not oppose the general principle that fertility is not highly correlated with 
inherited characters. 


(9) The reader may ask: Why have I considered in the same paper such 
diverse work as that of Jennings, Hanel and Lloyd? The reason lies in this: 
We have been told recently in an ex cathedra fashion by a distinguished biologist* 
that : 

“ Of the so-called investigations of heredity pursued by extensions of Galton’s 
non-analytical method and promoted by Professor Pearson and the English 
Biometrical School it is now scarcely necessary to speak....... A preliminary 
acquaintance with the natural history of heredity and variation was sufficient to 
throw doubt on the foundations of these elaborate researches. To those who 
hereafter may study this episode in the history of biological science, it will appear 
inexplicable that work so unsound in construction should have been respectfully 
received by the scientific world.” 


It seems therefore, however regrettable, needful to re-emphasise the point from 
which this Journal started more than eight years ago. Biology requires the help 
of a more exact logic than it appears to possess, above all of an adequate statistical 
theory to enable it to interpret its observations and test its own theories. I have 


* W. Bateson: Mendel’s Principles of Heredity, Cambridge, 1909, pp. 6—7. Luckily the doctrine 
of infallibility is not yet accepted in the world of science. 
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endeavoured to illustrate this in the present paper, by indicating how a want of 
statistical insight has resulted in at least two cases where experiments have been 
interpreted in a sense, which is in no way justified by the observations themselves. 
Regression within the pure line exists in Hydra, and fertility is associated with 
weight in rats. These are far from the only cases in which principles bearing on 
evolution have been propounded on the basis of researches, which prove in fact 
their negation. The biometrician does not desire to waste the energies that may 
be given to better work in personal recriminations with anonymous writers, nor 
in controversy with biologists of repute; but it is desirable to assert occasionally 
that biometry is very far from moribund, and that biology stands to-day even more 
in need of its methods than it did eight years ago; for it has now developed theories 
of such complexity, that without the aid of the highest mathematical analysis, it 
is wholly unable to state whether its theories are accurate or not. And this many 
Mendelians admit to-day, and all will have to admit in the very near future. From 
small fragments our Frankenstein has created his soulless monster; he will need 
all the mathematical aid he can enlist, if his creation is to be brought under control, 
and not to end in destroying him. But I propose to deal further with some 
recent Mendelian developments on another occasion. 











ADDITIONAL TABLES AND DIAGRAM FOR THE DETER. 


MINATION OF THE ERRORS OF TYPE OF FREQUENCY 
DISTRIBUTIONS. 


SECOND PAPER. 
By A. RHIND, B.Sc. 


In Biometrika, Vol. vil. page 131, a diagram was given to determine the type 
to which an observed frequency distribution belongs for given values of 8, and f,. 


The probable errors of 8, and 8, define a region within which the most 
probable values lie. 


Since, however, 8, and , are highly correlated this area is not a rectangle but 
an ellipse ; a contour of the surface 


N _1 1 fa _ 2Rep.zy, y? 
aa AE ry ys e 21-Ripp, (o%4, 6, %, ay 
2mrop,o5, V1— Rss, 
or, referred to principal axes, 
1 4X2. ¥? 
= = =f ? = + = (A). 
2rz,>. 2) eee werreenserereeseesseeser 


The semi-axes of an elliptic cylinder, with section similar to a contour of (A), 
and containing half the frequency, are 11772, and 1°177%,. 


These are inclined to the axis of 8, at angles 6 given by 


tan 20 = a LT = Rg», tan 2¢, where tan ¢ = o¢,/¢5,. 





os, o's, 


The directions were calculated from this formula and Diagram C (p. 389), 
constructed in such manner that the tangent and normal to one of the broken- 
line curves at any point are the major and minor axes respectively. The major 
axis is inclined to the axis of 8, at angles given by Table IX. 
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A distinction between the directions of the major and minor axes is readily 
made when we note that a contour of (A) is inscribed in a rect: . '* of sides 2e,, 


and 2o,,, and that the major axis of such an ellipse is inclined to tue greater side 
at an angle less than 7/4. 


For if aa? + 2hey + by? = 1 be an ellipse inscribed in a rectangle sides 2g and 2p 
having the greater side 2q parallel to the axis of #, and centre as origin, then 


hae” OG a." a. k* ee 


Again a cos? @ + h sin 20+dsintO= 7, ‘ 
: 2dr 
= 9 =—— 
(b — a) sin 20+ 2h cos 26 7 dO" 
For maximum or minimum r 
— 2) 
tan 26, = b —e 
: 2 @r : Se 
2 (b— a) cos 26, — 4h sin 20, = — — 16 at a maximum or minimum, 
1: 
xn 2 Ds 
a con 20, © aPv+4h? 2 d& 


(b-a)~=— 2 d&* 
dr. : : oe 
Hence 1: '8 negative when cos 26 is positive, 


.. @ for a maximum is < 7/4. 


2 V1 — Rs, 78,78, 


If sin y= 
Y o*g, + o°., 
then >, = Vo%, + o%,, sin $y, 


> s _ Vers, + o"s, cos by. 


From these formulae =, and =, were calculated and the values of 11773, and 
11772, tabulated in VII. and VIII. respectively. The number must of course be 
multiplied by the factor -67449/VN. 


Since the population outside an ellipse of semiaxes «>, and «>, is Ne-*’, it 
follows that, if an ellipse of twice the above linear dimensions be drawn, the 
probability that a point lies outside it is 1/16. 


If an ellipse be drawn with semiaxes «>, and «>,, the probability P that a 
point lies outside the ellipse is P=e-**. Let 


‘67449 


ve 
q 


«>, =1177 VNS, x 
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Then when g=10, «* = 6'3006 and p= = approximately, 


1 
= 2— 7° = 
” ” q di 12, K 7 5609 » Pp 44 ” 


Hence to find the semiaxes of ellipses containing 22/23 or 43/44 of the whole 
frequency ; take the numbers from Tables VII. and VIII. and multiply by the 
factor from Gibson’s Tables (Biometrika, Vol. tv. p. 385) corresponding to popula- 
tions of V/10 and N/12 respectively. 


To facilitate the construction of the ellipse, a series of ellipses has been drawn 
on Diagram C of different dimensions and varying in ratio of ,/, from 1 to *3. 


In practice the values of the axes should be taken from Tables VII. and VIII. 
and multiplied by the appropriate factor. Then their ratio gives =,/>,. By means 
of a piece of tracing paper applied to the corresponding series of ellipses, the 
required curve can be approximately drawn and transferred to the diagram. It 
will then be possible to determine whether the frequency really lies with a definite 
probability within the chosen type, as indicated by the B,, A,. 


The series of tables has been extended by the inclusion of Table X. which 
gives the probable error of 
oe — Bi(Ba+3P 
~ (48, ow 3P;) (28, — 38, — 6) 
The s.D. was determined thus : 
1 2 1 458, — 38 288, — 38 
= tem 38 5 O- Ga an — aap, 6) 


2 amare : 88 
~ ((A:+8) (48,—38,) (28,—38,— 6 ; 
1 3 3 
‘ la * (48,— 88,) * (28,— 38, — a| - 
whence 
on, = Ce {A?e%,, + 2ABRaz», op,78, + Bio*,,}?, 
where 
ni __ (P+ 3) 
2 (48, — 38) (28, Na 3B, = 6) ; 
A= 8B. (Be re 3) — 9B,?, 
BS 4+ 8, (8, + 5)— B.(B, + 4) 
B=1 
SA (Bo +3) ; 


c 


Erratum. 1n the printing of my former paper a slip of sign escaped my 
notice. On p. 129, Biometrika, Vol. vit, |. 7, for a =(28,— 38, — 6)/(B,—3) read 
a=(28,— 38, —6)/(8.+ 3), which was, of course, the value used in the calculation 
of the tables and diagrams. 

















for) 
2) 
of} 


A. Ramp 


Diagram C. 
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TABLE VII. 


Values of 1:77V NX, for given values of B,, B, (Semi-Minor Axis 


of Probability Ellipse). 























No) FOO DADANPHH OOS ODA AM HF ODAAFEOPOMAH AE OO a | | 
> COSCO SOF HAHAH RRS OUI GIGI 9D 99 09 HH HS ID DI~~ODMS | 
r BOM DAOSHAMA DOE AOAMOEANOHASSSNLAAGoms | l 
6 | SS SSS HAAR AAS AAS HAAAAAHM HM HHDDOSOOH DOTA | 
ip | DOK DHROSOASAMAOHOErRASAMVMOEANHDAGAErVSHrOL ao) | | | 
ae SDDS Ot tt tt rt at GA GN GU GY GY 69 69 69 AH HID IDO I DM | 
bie | Cr RDPOAANAMDAOLHOAANAWODOMOD DHIr~AOAPNAM WA HO | 
= SSS SOR AAA RAR SRR AAA AN DHHS DOrDAamM AAS a 
> | SEOAPSAARGHSPHPLOASARPSOSODASEOPP SOSH | | | | 
‘ DSO Oa tt at tt GUT GY GN OD 09 89 HH HIG © 6 t~ | | 
rt I DROSHTAMAHSEDAAHAOARAWOH A DONAOR | | | 
* DOC OCR RB RAR BANANA NOD 09 0 HHI DOOD 
re) FDORDOBHAHAMANMOLrDOAHMOE-ANNQDAOGOAeK AAAS BB | | 
” SSS SR AA AAAS SAAN AT AANA HH DOMDD ! 
~~ | FOROS HTANMAHOKrDOSHHD Or OV OAHe ADO HAE | | = 
J DO NNN GN GN 9 69 19 OD AH HID DOI DD I he. | 
aa HDHHROSHNMRHO OMRHOAMVVErOMOCOWE OPEPLS | l | 
° | pA IE IE AB IE RLS AM Pi SE AR 2 Oran 
- DOHASSOAAAPPOHOO ROM MOSKOMSSOPOPV ASE SSA | 
= SD Ot tt rt tt tt GU GT GY GY GT OD 69 OD SH mT HID 1 OE 
ID | DHDHDODOAAMMNM AMD OMRAOAVOM-PRHORWOHSM | | Af en | 
| DDS tt at tt rt GT GU GT GT GT 0D 69 69 69 Ht HO | | 
| LORSSHAAMM AH OOSCHAANAOAALHOALN ; cht Pid 
: SOS INN 0D 09 OD HH ! ! 
“WY DOHAROOAAMMAHSOSOHOAOAHM OM SOMOOMD | 1 | | | | 
™ SOOO ee TT GUNN DGD HHH | ! 
a | Ir SOAAOSOHANM POSH ODOAMNHOMSOOT | } | te & 
* DD DOO at te tt GU GT GT GT 9D OD OD HH i i 
9 CEOOEPPPLEPOHASOANFOPSLON | | A&E | | | | | | | | | | | 
r pid i dics acta dy whe dh A goalesplblleecemma eee Btd ind ' ' 
| SOOSOSCSCSCSCSCSCOSOS 
SHARPS SH HASH RH HH SHL SASH NP MSH OSH HORST RHP SH eg 
Se Se Se Se RAV V NW So FH 49 89 9 99 99 F999 9D SHAS SPSS SSS SSH ]H 6 19 1H 1H 1H 1H 19H 


a 


Q 


6°6 
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TABLE VII.—(continued). 


Values of 1°77 VN, Sor given values of B,, B, (Semi-Minor Axis 
of Probability Ellipse). 


B, 















| | 
0°4| 03 0:3 0-2 0°1/ 071/00 0°0) 0-0 0-0 9°0| 00/00 0° 0-0) 2-0 
0°5 | 0°5 | 0-4 | 0-3 | 0-2 | 0-2 | O-2 | O-1 | 0-1 | 0-1 | 0-0 | 0-0 | 0-0 | 0-0 | 0-0 2-1 | 
| 0°6 | 06 | 0°5 | 05 | 0-4 | 0-4 | 03 | 03 | 03 OQ O'1 | 00 | 0-0 | 00 | 0-0 | 2-2 | 
| 0-7 | 0-7 | 0-6 | 06 | 0°5 | 05 | Ord | O'4 | 0-4 0-3 0°2 | OF] | O-1 | 0°0 | 0°0 | 23 | 
| 0°8 | 0:8 | 0-7 | 0°7 | 0°6 | 0°6 0-5 | 0° | 0-4 0-4 0°3 | 02} 0-2 071 0-0] 2-4 
0°9| 0°83 0:8 | 0:7 0°7| 07 0°6| 0°6 | 05 | 0-5 | 04 | 03 | OB | 0-2 Ol | 25 
1°0 | O°9 | 09 | 0°8 | O°8 | 0°8 | O°7 | O°7 | O°7 | O-G | O°G | O'S | O°4 | O°3 ied 26 
}1°1 | 1°70) 1:0 | 0-9 | 0-9 | 0-9 | 0°8 | 0°8 | 0°8 | 0-7 | O°7 | O°6 | O°5 | O°4 | O'3 | 2-7 
| LL) 11 11) 1:0 1:0 1:0 0-9) 0°9| 0-9 0-8 | 0°8 | 0°7 | O°6 | 0°5 | 05 | 2-8 
P12] 1-2 1-2) UL UL) 1-0) 1-0} 1-0) 1-0) 1-0 | 0-9 | 0°8 | O-7 | O°7 | O°7 | 2-9 | 
| 13 1°3 | 1-3| 1°2 | 1°S| 1°2| 1:1 | 1°1 | 1°1| 1-0 | 10} 0:9} 0-9 | 0°8 0°8 | 3°0 | 
| 1:4 141-4) 13 1B) 1B) 1-2 | 12) 1-2) 1-1 | LL] 1-0) 1-0 | 0-9 | 0-9) 3-7 | 
15 | 1°56 | 1°56 | 1-4 | 1:4] 1-4] 1-3] 1°3 | 1-2} 1-9 | 1-2] 171) 1-1) 10) 1:0) 3-2 | 
7) 16 16) 15 14) 104 | 13 | 13 | 1B) 1-2) 12] 12) V1) el 10} 3-3 | 
18] 1-7) 1-7 | 16) 16 | 16) 15 | 15 | 1-4 | 1-4) 13] 1B) 1-2) 1-2) V1) Sy | 
1°9)1°9 | 1°8 | 1°8 | 1°7 | 1°7) 1°6 | 16) 1°5 | 15 1-4| 1-4 13) 13) 1:2] SS | 
2°0 | 2°0 | 2°0} 1°9 | 1°8 | 1°8| 1°7 | 1°7 | 16 | 1°6 | 1°5 | 1° | 1-4) 14 1-4| 3-6 | 
2°2 | Q-2 | Q-1 | 2-1 | 2-0 | 2-0] 1°9| 1°8 | 1-7 | 1-7 | 16] 16] 1-6) 1:5) 15] 3-7 | 
2-4) 2-3 2-2) 22/21 21) 2-0) 2°0/ 1-9) 1-9) 18} 17/17/16 16! 3-8 | 
2°6 | 2:5 | 2-4 | 2-4 | 2-3 | 2-3 | 2-2 | 2-1 | 2-0} 2°0 | 1°9| 1°9| 1-8] 1°7 | 1:7] 3:9 
| 2°8 | 2°7 | 2-6 | 2°6 | 2°5 | 2°5 | 2-4 | 2-3 | 2-2 | 2-2 | 2-1 | 2-1 | 2-0 | 2°70) 1°9| 4-0 
3°O | 2°9 | 2°8 | 2°8 | 2°7 | 2-7 | 2°6 | 2°5 | 2-4 | 2-4 | 2B | 23) 2-2 / 2-2) Q1) 41 | 
| 3°2 | 3:2 | 3-1 | 3°0 | 2°9 | 2-9 | 2°8 | 2°7 | 2°6 | 2°6 | 2°S | 2-4 | 2-3 | 23) 22! 4-2 
3°5 | 3°5 | 3-4 | 3°3 | 3-2 | 3-1 | 3-0 2-9 | 28 2°8 | 2°7 | 2°6 | 2-5 | 2°4 | 2:4] 4-3 B 
3°9 | 3°8 | 3-7 | 36) 35 3-4 33) 31/30 3-0 29) 28) 27/27 26) Ay | 
| 4:2) 4-1 | 40 | 3:9 3°'7 36 | 3°5 | 8:4) 3°3 | 3-2 31) 31) 30 380 29) 45 
| 4°6 | 4°5 | 4°4 | 4-2 | 4-1 | 3:9 | 3°8 | 3°7 | 3°5 | 3-4 33 | 3°3 32/32 31) 46 | 
| 51 | 4°9 | 4°8 | 4°6 | 4-4 | 4-3 | 4-2 4-0 | 3-8 3°7 | 3°6 | 3°5 | 3:4 | 3°4| 3°38] 4-7 | 
5°6 | 5°4| 5:2 | 5°0 | 4°8 | 4°7 | 4:6 | 4°4| 4-2 | 4:0 | 3°9 | 3°8 | 3-7 | 3°6 | 35 | 48 
62 | 6°0 | 5°7 | 5°5 | 5°3 | 5°1 | 5:0 | 4°8 | 4°6 | 4:4 4-2 | 41 4:0 | 3:9 | 3°8| 4:9 | 
6°8 | 6°6 | 6-3 | 61 | 5°8 | 56 | 5:4 | 5°21 5-0! 4:8 4°6 | 4-4 4°3| 4:2) 41) 5-0 
7°4| 7°2| 7-0 | 6°7 | 6-4 | 6-2 | 5°9 | 5-7 | 5:4 | 51 | 4°99] 4°7 | 4°6 | 4:4 | 4:3) 51 
8°3 | 8:0 | 7-7 | 7°4 | 7°1 | 6°8 | 6°5 | 6°2 | 5°8 | 5°5 ad hte 4:9} 4:7) 4°5 | 5:2 
9°2 | 8°9 | 8-6 | 8-2 | 7°8| 7-5 | 7-1 | 6°7 | 63 | 5°9 | 5°6 | 5°4 | 5-2) 5°0 | 4°7| 53 
10 | 10 | 9°6 | 9°1 | 8°6 | 8:2 7°8 | 7°3| 6-9 | 65 | 61 | 5°8 | 5-6 | 5-3 5-0] Sy 
12 | 11 | 11 | 10 | 9°5 | 9°0| 8°5 | 8:0 | 7°5 | 7-1 | 6°7 | 6°4 | 6-1 | 5°8 | 5°5 | 5°5 
— | 12 | 11} 10} 10 | 9°5 | 8-9} 8-4 /| 7-9 | 7:4 | 7°0 | 6°6 | 6-3 | 6-0 | 5°6 
14 | 13 | 12 | 11 | 10 | 10 | 9-4! 8°8| 8:3 | 7°8| 7-4] 7:0 67 | 57 
16 14/13) 12 12/11 | 10 | 10 | 9°4| 88/83/78 | 7-4] 58 
— |} — | 18/ 16/ 15 | 14] 14 | 138 | 12 | 11 | 10} 10 | 9°5| 8-9) 8-4) 5:9 
| 22 | 20/| 18 | 16 | 15 | 14 | 13 | 12 | 12 | 11 | 11 | 10 | 9°5| 6°0 
_ — | 18} 16] 16 | 14] 13 | 13 | 12)| 18 | 1 | OF 
—|- 21.19/17 16 15 | 14) 13 | 18 =e 2 
o4.| Si} 1 8 | 17] 16] 15 | 14] 13| 63 
- in 7 24 | 92 | 90 | 19 | 18 17. 16 15 | 64 
- 32 | 28 | 25 | 23 | 21 | 20 | 19 | 18 | 16 | 6:5 | 
- ey ees |p =a OG | 
— - - 6°7 
= a 68 | 
—}— —|—|- = 6-9 
— | — — - 70 
50—2 
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Probable Errors of Frequency Types 


TABLE VIII. 


Values of 177 VN 3, for values of B,, Bs 
of Probability Ellipse). 


A, 


(Semi-Major Axis 


























0| +05 +1 15 | 2 | 26) 3 | 35 | 4 | 45 | 
oho oo 
1-6| 1-9| 2-2 25 | 2°8| 3-1 3°4 | 3:7 ail aa 
17 | 21 | 2-4 | 2-7 2-9) 3-2 3°5| 3-8) 41 |"4-4 | 
V9) 2°3 | 26 | 2-9 | 3-1) 3-4 3-7 | 3-9 4-1 | 4-4) 
2°1| 2°5 | 2°8 | 3-1 | 3°3/ 3-5 | 3-8 | 4-0 | 4-2 | 4-4 | 
2-4) 28/31 | 3-3 3°5| 3°7 | 39 | 4:1) 4:3 | 4-5 
2°8) 3:3) 35/37 38) 4:0) 4-1) 43/45 | 4-7 
33/38) 40/42 4:3) 4:5|4-6| 4-7 | 48 4:9 | 
3°8| 4°4 | 4°6 [4-7 | 4-9/ 5-0) 5-1 | 5-2 | 53] 5-4 | 
4°4| 5-0 | 5:3 | 5-4 asi ties 59 5-9| 5-9 
5'1| 5°7 | 6-0 | 6-2 | 6°3| 6-4 | 6-5 | 6°6 | 6-5 | 65 | 
5°8| 65 | 6-9 | 7-1 | 7-2| 7:3 | 7-3] 7°3| 7-2 | 7-1 | 
6°7 | 7°5 | 8°0 | 8°3 | 8-4| 8-4 | 8-4 8°3 | 8-2 | 8-0 
7°8| 8:7 | 92 |9°6 9:8| 9°8 | 9°7 | 9-5 | 9-4 | 9-2 
92/10) IL} 11) 11) 11) a1} 11} 1} 11) 
11 | 11) 12/12/13) 13] 13] 12] 12/ 12 
13| 13 | 14|14/15| 15} 14 14| 14] 14 
15 | 16 | 17] 17 | 17] ay 16| 16) 16) 16 
18 | 19 | 20] 20 | 20| 20 | 19] 19] 18) 18 
21 | 23) 24) 24 | 24) 23 | 92| 22) 21/ 21 
25 | 27 | 28) 28 | 28| 27 | 26) 25| 94| 24 
29 | 32) 34) 33 | 32| 31 | 30) 29| 28| 27 


=~ | -- | — | 200] 160| 135 | 119 
5 ; Te — | 384 | 234! 182! 153 




















5 | +55 
4:7 50 
46 4:9 
46 4:9 
46 4:9 
|4°7 4:9 
4:8 5-0 
50! 5-1 
5°4 Bra 
58 | 5:8 
6:4 | 6-3 
7:0 6-9 
7978 
9-0 | 8-8 
10} 10 
11] 11 
13) 19 
15| 14 
17| 17 
20} 19 
23) 22 
26| 24 
30| 98 
34/ 32 
40| 37 
46| 43 
53| 49 
61| 57 
72) 67 
&6| 79 
105) 95 
130 115 





OWIWIAADAUAM AN 
AAWHDHDUHNHeH ONO 


_ 
—_ 


DW BD TD DD DD HD DD HD D 


— ° 
Se oece Looe 


5°7 | 61 
5°6 | 6-0 
5° | 5-9 | 
5°5 | 5°8 
B:4 | 5°7 
5°4 | 5°7 
55 | 5°7 
56 | 58 
5-9 | 6-0 
6°3 | 6-4 
6°8 | 6°8 
7°6 | 7°5 
8:4 | 8-3 
9-4 | 9-2 
11| 10| 
12) 11 
14| 13 
16 15 | 
18} 17| 
20) 19) 
22| 21 
25| 24] 
29} 27) 
33| 21] 
38 | 36 
43| 41 
50| 48 
58| 55 
68| 64| 
79| 74) 
92} 85} 
110/101 | 


| 166 | 146 | 


203 | 177 
297 | 230 


Coe wor 





~ 
— 
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TABLE. VIII.—(continued). 


Values of 1'77V NS, for values of B,, B, (Semi-Major Asis 
of Probability Ellipse). 


Bi 


| x 7 | | 
| 8 | 85 | “9 | 1:0 10s| 11 |1-15 








1-4 | 145) 15 


























68|72|76|79| 8:3 8-7 $1 es )es) o) Hl 2 11| 12! 12/20 
6-7 | 7-0 | 7-4 | 7°8 | 8-1 | 85 89 |9°2}9:6| 10 | 11] 11] 11] 11] 12] 21 
6-5 | 68 | 7-2| 7-6 | 8-0) 8:3) 8-6 | 9-0) 9-4/9°8| 10| 10) 11) 11} 12) 22 
64 67 | 71 7°4|7°8 | 8-1| 85 |88|92/96)99| 10] 11| 11) 12) 23 
6°3 | 6°6 | 6-9 | 7°3| 7-6 8-0! 8:3 | 87/90) 9-4|98) 10) 10) 11) 11) 24 
6-2 | 6-5 | 6°8 | 7°1| 7°5|7°8| 8-2 | 8-5| 89/92/96} 10] 10} 11] 11) 25 
6-2 | 65 | 68 | 7-1) 7-4) 7:7) 80 | 83] 8-7 9-0) 9-4) 9-9} 10) 11 11) 26) 
6°3 | 6°5 | 6°8 | 7°0| 7°3| 7°6| 7-9 | 8:2| 85 | 8-8|9-2|9°7| 10) 10| 11] 27 
6:4 | 6°6 | 6°8 | 7°0 | 7-2 7°5| 7-7 | 8°0| 8-3 8-7 | 91) 95/98) 10) 10) 238] 
6°6 6-7 | 6°8 7°0 | 7°2|'7°4| 7°6 | 7°9| 8-2 8-6 | 8°9 93/96) 10) 10) 29 
6°9 | 7°0| 7-1 | 7-2| 7°3| 7:5 | 7°7 | 7°9 | 8-2 | 8-5 | 8:8 | 9:1] 9-4|9°7| 10) 3-0 
7°3 | 7°3| 7:3 | 7:4) 7°5 | 7°6| 7°8 | 8-0 | 8-2 | 8-5 | 8-7 | 9°0 | 9°3 | 9°6 | 9°9 | 3-7 
| 7°9| 7:9 | 7°8| 7°8| 7°8| 7°9| 8-1 | 8-2 | 8-3 | 8-5 | 8-7 | 8-9 | 9-2 | 9-5 | 9°8 | 3:2 
8°7 | 8-5 | 8-4 | 8:3 | 8-3 8:3] 8-3 8-4 | 8:5 8:7| 89/91/93 95/97 | 33 
9°5 | 9°3| 9-1 | 9:0| 8-8 | 88| 8-8 | 88/89 9-0| 9-1 | 9:2| 9-4 95 9°7 | 34 
11| 10| 10|9:8|9°6 | 9°5)| 9:4 |9°3| 9:3. 9°3 9°3| 9:4 9:5 96 | 9°8| 35 
| 12| 11] 11] 11] 10] 10; 10] 10] 9-9 | 9-9 | 9:8 | 9-7 | 9-7 | 9°8 | 9°9 | 36 
13] 12] 12) 12] 11] 11] 11] 11] 11; 10] 10] 10] 10) 10) 10/37 
15| 14| 14| 13| 13] 13} 12] 12| 12) 11] 11] 10] 10) 10) 11) 38 
17| 16] 16] 15| 14| 14] 14] 13] 13| 12| 12) 11] 11/ 11 1 9 
| 19} 18] 18 17 16| 16| 15| 14] 14) 18] 13] 12] 12] 12| 12] 40 
| 22| 21) 20) 19] 18) 17| 17| 16] 15| 15| 14] 14) 13) 13) 13) 41 
| 24] 23| 92) 21] 20| 19| 18] 18| 17) 16| 15| 15] 14| 14] 14| 42] B, 
28) 26] 25] 23] 22] 21) 20] 20) 19] 18] 17] 16] 15) 15] 15 ip 


DS Cir So 











doe de de te de te te te te te Se Oo Se 
Oo ~2 : 


D Cr TH ST GS Or Sr Sr Or St Sr 
BAVA Ow SOOM 


Co SO 


or 
© 
o 
Oo 
to 
_— 
© 
_ 
o> 
cs 
Ow 
c= 
= 
w 
No) 
w 
4 
on 
w 
ww 
w 
to 
As 


| 136 | 123] 112) 102] 95| 87| 80| 75) 70 9| J 
167 | 147 | 132] 119] 108|100| 92) 85) 79° 74) 69| 64) 60) 57) 54 

| — 160|141|]126|115| 105| 96| 89 83) 78| 73| 68) 64| 60 

— | — | 206| 169] 148}132) 120|110|102, 95| 88| 82) 76) 72| 67 

| — | — | 258 | 206 | 175 | 150 136 | 126/116 108|100| 93] 87) 81| 75 

| — | — |318| 255 | 215 | 190 | | 150 | 25 ¢ 

| - 446 | 332 273 | 228| 200) 178) 161 147 | 134/123] 113) 104| 98 


-_ 
for) 
i 9) 
a 
or 
x 
Oo 
— 
w 
fon) 
— 
~) 
Co 
_ 
_ 
i) 
_ 
o 
=1 
< 
=) 
© 
bo 
io) 
SS) 








480 | 364 | 2 
a ie 680 | 477 | 
oe) yew Se es | F087 680 











Ee Pe — | —|—]|—| — | 412} 968] 320] 63 | 
ot ae Se et ee — | — | — | — | 525 | 446 | 390| 6-9 | 


ies feat fag tee fart as ec be — |809 — ba 
| | 


is — |—|—|—|—| —|338| 297 /| 266] 67 | 
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TABLE IX. 


between Major-Aais and Axis of B, (Probability Ellipse) 
measured in degrees. 


A, 





2 | 25 

28 | 31! 33 
25 | 28 | 30 
23 | 26 | 28 
23 | 25 | 27 
20 | 23 | 25 
18 | 21 | 23 
17 | 20 | 22 
16 | 19 | 21 
15 17 | 19 
14 | 16} 18 
13 | 15} 17 
12 | 14| 16 
12 | 14) 16 
11 | 13 | 15 
10 | 12 | 14 
9/11) 13 

8 | 10) 12 

71 8) 2 

7 9/10 

6 8 9 

6 | 7 te! 

s 6 7 

4 5 6 

4 5 5 

3| 4 5 

2 3 4 


~t 
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TABLE IX.—(continued). 


Angle between Major-Awis and Axis of B; (Probability Ellipse) 
measured in degrees. 


B, 




















| | } | | 
| -8 85 | ‘9 96 | 10 |105 1-1 | 115) 12 | 1°25) 1°83 se 1h \1-45) 1°5 | | 
| 43 | 43 | 44 | 44| 45 | 45 | 46 | 46 | 46 | 46 | 47 | 47 | 48 | 48 49 | 2-0 | 
42 42 | 43 | 44 | 44 | 44| 45 | 45 | 45 | 46 | 46 | 47 | 47 | 48 | 48 | 2-1 
| 41 | 41 | 42 | 42 | 43 | 43 | 44 | 44 | 45 | 45 | 46 | 46 | 46 | 47 | 47 | 2-2 | 
| 40 | 40 | 41 | 41 | 42 | 42 | 43 | 43 | 44 | 44 45 | 45 | 46 | 46 | 47 | 23 | 
1 39 | 39 | 40 | 40 | 41 | 41 | 42 | 42 | 43 | 43 | 44 | 44 | 45 | 45 | 46 | 2-4 | 
| 38 |.38 | 39 | 39 | 40 | 40 | 41 | 41 | 42 | 42 | 43 | 43 | 44 | 44 | 45 | 25 | 
| 37 | 37 | 38 | 39 | 39 | 39 | 40 | 40 | 41 | 41 | 42 | 42 | 43 | 43 | 44 | 26 | 
| 36 | 36 | 37 | 38 | 38 | 39 | 39 | 40 | 41 | 41 | 42 | 42 | 43 | 43 | 44 | 2-7 | 
134 35 | 36 | 36 | 37 | 38 | 38 | 39 | 40 | 40 | 41 | 41 | 42 | 42 | 43 | 2-8 | 
| 33 | 34 | 35 | 35 | 36 | 37 | 38 | 38 | 39 | 39 | 40 | 40 | 41 | 41 | 42 | 2-9 | 
| 32 | 33 | 34 | 34 | 35 | 36 | 37 | 37 | 38 | 38 | 39 | 39 | 40 | 40 | 41 | 3-0 | 
| 30 | 31 | 32 | 33 | 34 | 35 | 35 | 36 | 37 | 38 | 38 | 39 | 39 | 40 | 40 | 3-1 
| 29 | 30 | 31 | 32 | 33 | 34 | 34 | 35 | 36 | 37 | 38 | 38 | 39 | 39 | 39 | 3:2 
| 98 | 29 | 30 | 31 | 32 | 32 | 33 | 34 | 35 | 36 | 37 | 37 | 38 | 38 | 38 | 3-3 | 
26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 36 | 37 | 38 | 38 | 3-4, | 
25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 35 | 36 | 36 | 37 | 35 | 
24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 34 | 35 | 35 | 36 | 3°6 | 
a2 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 31 | 32 | 33 | 33 | 34 | 34 | 35 | 37 | 
21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 33 | 34 | 3-8 | 
| 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 32 | 33 | 39 | 
19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29-| 30 | 31 | 31 | 32 0 | 
11s | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 30 | 31 | 42 | 
1 i7 118 | 19 | 19 | 20 | 21 | 22 | 23 | 25 | 26 | 27 | 28 | 29 | 30 | 30 | 4-2 | 
16 | 17 | 18 | 19 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 2% | 28 | 29 | 4-3 | B, 
15 | 16 | 17 | 18 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 “Fas 
114 | 15 | 16 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 2 5 | 
7 6 





bo 
oo 
Oye de de de te SS ve 


| "9 | 10 | 11 | 12 | 13 | 13 | 14| 15 | 16 | 17 | 18 | 19 | 20 | 20 | 21 | 
| sg! 9/10/11 | 12| 13 | 13 | 15 | 16 | 16 | 17 | 18 | 19 | 20 | 20 | 5-2 | 
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MISCELLANEA. 


I. Dr Galloway’s ‘‘Canary Breeding.”’ 
By C. B. DAVENPORT. 


In a paper published in Biometrika, Parts I and II of Vol. vi, Dr Galloway has criticised a 
“recent memoir” by me, “especially the material used” and the “use of terms.” Criticism 
is always to be sought for by scientific workers and is usually beneficial to the progress of science ; 
and, just as criticism is good, a rejoinder is often helpful to afford the original author, in the 
light of that criticism, opportunity to state any necessary change of conclusions or to make 
clear any difficulty which the critic has encountered. Accordingly I feel it to be my duty, 
although a not altogether agreeable one, to take the time and space for a rejoinder. 


The principal points of criticism that Dr Galloway makes are (1) that I should “have 
selected the original stock with much greater care” (p. 2); (2) that my “sole criterion of a 
perfect crest...seems to be absence of a bald occiput” (p. 2); (3) that “yellow does not mean 
merely absence of dark feathering, as in Davenport’s paper, but a particular quality of feather 
which may occur in a green, variegated or clear bird” ; (4) that my statement that hybrids with 
the yellow canary “ frequently show more or less of the canary yellow” is to be contradicted ; 
and (5) that in my green x “ yellow” matings, my “green” is not green but variegated and my 
“ yellows” are also variegated. Besides these criticisms there are some points of difference in 
experimental results, namely ; (a) in the offspring of “ variegated” x clear green, or variegated 
(pp. 3, 4, 26), (b) in the mating of two crests, and (c) in the absence of homozygous crests. 
Finally, [ shall be glad if I can assist the author and science by a few criticisms of some new 
points in his paper. 


Of the five criticisms it will be observed that four deal with definitions and one with the 
material used by me. The latter point may be considered first. The criticism on my original 
stock is not directed towards any concealment on my part, for the data of origin, so far as 
known, are given and one or more paintings of original stock published. It is rather directed 
towards the unfitness of races that are bred for song to give evidence concerning inheritance of 
colour and towards the fact that my birds were not “pure-bred.” I am somewhat embarrassed 
in replying to this criticism since the author does not define what he means by pure-bred. 
I know what notion I associate with “pure breeding,” namely, continual self-fertilization, such 
as occurs in sweet peas and many other plants, but obviously that kind of pure breeding 
is unknown among canaries. Some fanciers, | know, mean by pure breeding mating inside the 
same race or variety ; but this definition is hopelessly vague since there is no agreement as to 
the limits of relationship or similarity compatible with these terms. A few fanciers mean 
by “pure-bred” birds, birds having a characteristic a all of whose ancestors for several gene- 
rations have the same characteristic a. For such, the offspring of Crested Norwich x Crested 
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Manchester Coppy would be “ pure-bred.” This meaning of “pure-bred” is at least clear ; but 
its utility may be doubted ; nor does such “pure breeding” insure uniformity of progeny, since 
even if the crested parents have had only crested ancestors for many generations they may still 
throw plainheads. As a matter of fact, all philosophical breeders of experience realize that the 
fetish of “pure breeding” is a will-of-the-wisp, and for the most part a hopelessly obscure, and 
unsatisfactory conception. Practically the experienced breeder need not greatly care (within 
limits) about the ancestry of his birds. A little experience in breeding will give him greater 
assurance as to the character of the germ plasm than the affidavits of the fancier. To return to 
my stock, they were live, healthy and mostly fecund birds,—canary birds, no doubt. That is 
about as far as it was necessary to go. If I offered the additional information that they might 
be classified as “ Harz” or “ Norwich,” that might help a possible critic ; but I laid little stress 
on the statement and, from my experience, think it unimportant. I would defy anybody to 
classify my canaries now, but nevertheless I know more about their germ plasm than most 
punctilious fanciers do about that of their pure breeds. And a knowledge of performance 


of germ plasm, not capacity for satisfying the conditions of the “Standard,” is what I have 
worked for. 


As for my terms, it is the privilege of an author to use them as he sees fit provided he 
defines them clearly. Not having the fortune to live at Aberdeen and to mingle with Scotch 
fanciers, I have, no doubt, defined some terms differently from my critic. That was not only 
harmless but necessary. I specifically defined “imperfect” crest as I was about to use it 
for the immediate purpose of the Table and four paragraphs of Text (p. 11) as “more or less 
bald on the occiput.” I cannot find that my critic anywhere gives his definition of the term. 
He merely says (p. 2) “this bald spot accompanies most of the perfect crests of the present 
day.” Our use of the word “ perfect” in the two sentences differs—that is clear and sufficient. 


Next, my critic objects to my sense of the word “yellow.” I was not unaware of the use of 
“Yellow” by most English fanciers in a technical sense. _Thus Blakston says (p. 94), “we 
speak of a Yellow Green or a Buff Green, a Yellow Cinnamon or a Buff Cinnamon, when it is 
patent that neither of these colours can be yellow or buff in reality ; and the words, therefore, 
taken in their general application, are technicalities.” Now I was not writing for English 
fanciers merely, I was writing for men of science without regard to nationality, and to have 
used “Yellow” in the fanciers’ sense without a long explanation would have led to confusion. 
I preferred, in common with some other authors in English (and as clearly stated on page 14 of 
my book), to use Jonque and Mealy instead of “Yellow” and “ Buff.” That left me free to use 
yellow in what Blakston calls the real sense; the sense also in which the German fanciers, 
as exemplified by Karl Russ, use it when they speak of Hochgelbe and Strohgelbe. If I had 
intended to use yellow as a technical group-name I should certainly have printed it in capitals, 
as Blakston does, and as I print “ Buff Cochin.” As for the assertion that my definitions 
of “jonque” and “mealy” are inadequate, referring to colour merely, instead of including the 
form of the feathering and that of the whole body of the bird, I may say merely that not having 
been acquainted with my critic I could not have known his use of the terms ; that my definitions 
were based on and are practically identical with those of Blakston’s great English classic 
(p. 94). In so far as they do not include form of the body of the bird they are devoid of the 
probably fantastic associations of a special group of fanciers in the vicinity of Aberdeen. 
At least, I find in my hybrids no necessary correlation of jonque and mealy plumage colour 
with the form of the body of the bird. 


As for the emphatic denial (p. 4) of my statement that hybrids of the yellow canary 
“frequently show more or less of the canary yellow,” I need only ask any intelligent reader if the 
assertion is contradicted by my critic’s statement that “the great bulk” of such hybrids are 
dark. ‘ Frequently” implies less than half of them light; “the great bulk” more than half 
dark. I might willingly have added to my book the same words as my critic, except to avoid 


51—2 











400 Miscellanea 


tautology. Finally, my critic’s categorical statement (p. 32) that my “greens” and “yellows” 
are both variegated (in his sense) is as gratuitous as it is unscientific and false as a matter 
of fact. 


We may next consider the differences between Dr Galloway’s experimental results and mine. 


(a) That, as stated on pages 3, 4, my critic should have got 15 unmottled to 5 mottled* 
offspring from non-crested birds, where 1 : 1 was expected is no argument against Mendelian results, 
since the numbers are insufficient. In my breeding, single families showed such deviation from 
expectation, but the totals of 32:30 revealed the true significance of the mating. As to my 
critic’s divergent proportions with crested birds no conclusion can be drawn without a knowledge 
of his definition of “ variegated.” I suspect he includes in this category birds with pigmented 
crests ; I did not. There is an interesting correlation between crest and head pigment in 
my birds and so head pigment was disregarded in crested birds, and such as had no other 
pigment were’called “clears” or “yellows.” 


(b) Dr Galloway finds that crest x crest-bred plainhead gives an equal number of crested 
and crestless young. As I showed in my paper this result follows when plainheads of any 
origin are mated with heterozygous crests. He finds that in the offspring of two crested parents, 
crests are to plainheads as 2:1. This is a very unanalytical grouping. The results are 1:0 
or 3:1 according as parents are homozygous or heterozygous in crest. 


(ec) Dr Galloway finds no case of crest x plain that produces crests only. This is not 
strange, as at least twice as many heterozygous as homozygous crested are to be expected. 


Passing now to a criticism of other points in Dr Galloway’s paper one might first wish for a 
clearer table of matings and progeny, with individual numbers so that one could trace results in 
cases where the same bird is used several times in different matings. This information the 
appendix does not readily, if at all, afford. Secondly, one can only regret that, as so commonly 
the case with scientifically untrained writers, so much stress is laid on crude speculation, in this 
case as to the origin of domestic races of birds through a “cinnamon sport.” In its application 
to poultry, certainly, the cherished theory is not appealing. And why is the commonest of 
Hervieux’ canary colour types, translated literally “gray” instead of technically “green” (p. 13), 
said to be closely allied to the cinnamon canary (p. 18) ; and even given as a sort of synonym of 
the latter (p. 30) ? 


At the end of this polemic it is a pleasure to speak of the valuable data in this paper 
of Dr Galloway’s, the account of the inheritance in eye colour with its confirmation of the 
Bateson-Punnett theory and the conclusion that two Mealies cannot produce Jonques. With 
the general theory that the races of domestic animals have arisen by mutations the writer finds 
himself quite in accord with Dr Galloway. One can only wonder that, where the Mendelian 
theory is so often appreciated and supported, the author should occasionally fail to grasp 
it. And, best of all, it is a pleasure to see the increasing catholicity of the active editorship of 
Biometrika in accepting such a paper for publication t. 


* Mottled, as used by me, is not the equivalent of variegated as used by Blakston and, apparently, 
by Dr Galloway. It includes ‘‘ light variegated,” “ heavily variegated ” and ‘‘ marked.” 

+ [This statement of Dr Davenport’s needs an editorial comment. No paper dealing with heredity 
from the Mendelian standpoint has ever been refused by this journal, although such papers would 
be declined if they were considered inadequate experimentally or theoretically. Biometric papers 
British, American, and foreign have been rejected on these grounds. K,P.] 
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II. Canary Breeding. A Rejoinder to C. B. Davenport. 
By A. RUDOLF GALLOWAY. 


In writing my “Canary Breeding,” one of the chief objects I had in view, was to secure 
some sort of agreement between scientific men and fanciers ; for the attainment of such an end 
would undoubtedly result in an increase of knowledge, and would greatly benefit both parties. 


Dr Davenport’s frequent references to fanciers and their terms, and his acknowledgment of 
Blakston’s Great English Classic, compel me to judge his statements from the fancier’s point 
of view, as well as from that of the scientific man. 


From the scientific standpoint only, his statements might be more consistent, though in all 
probability valueless ; from the fancier’s point of view, which is pre-eminently that of his 
great authority Blakston, it is easy to show that his interpretation of terms is extremely 
faulty :— 

In his Rejoinder, he makes the startling admission that he has in his tables classed dark 
crested birds as “clears or yellows.” The very thought of this is enough to make Blakston turn 
in his grave, for the following is his (Blakston’s) definition of a clear bird :— 


“ And here we draw the line” (at dark underflue), “‘and define ‘Clear’ to mean not having 
the remotest tinge of dark colour in quill, flue, or feather; birds from which every trace of 
the green has been eliminated ; which have a clear, silky, snowy-white underflue and show 
by it that they have arrived at the goal” (Cassell’s Canaries and Cage Birds, p. 100). 

Such an interpretation of standard nomenclature is unwarranted, and indeed incompre- 
hensible in one who regards Blakston with reverence and authority. It is needless to enter 
further into the meaning of the term “ variegation,’ which Blakston so fully and clearly 
explains (pp. 95-101), for it is evident that Davenport’s interpretation of it is his own con- 
ception (as indeed he admits), and consequently his results cannot be accepted by the initiated. 


Another extract from the Rejoinder must be given in full :— 


“As for the assertion that my definitions of ‘jonque’ and ‘ mealy’ are inadequate, referring 
to colour merely, instead of including the form of the feathering and that of the whole body of 
the bird, I may say merely that not having been acquainted with my critic, I could not have 
known his use of the terms ; that my definitions were based on and are practically identical 
with those of Blakston’s great English classic (p. 94). In so far as they do not include form of 
the body of the bird, they are devoid of the probably fantastic associations of a special group of 
fanciers in the vicinity of Aberdeen.” 

Nevertheless, if Dr Davenport will turn to p. 101 of the Great English Classic (of which, he 
will be glad to hear, a new edition will shortly be published—the original one having served its 
time and generation *) he will read as follows :— 


“The feathering on the Buff birds is much denser than in the Jonques, the under-flue being 
very thick and long, as the fancier will be able to observe when he makes his first essay in 
washing. Altogether the Buff bird is built on a larger and stouter scale, and is, in every respect, 
the more lusty bird of the two.” 


It is hopeless for Dr Davenport to escape from his erroneous use of the word “ yellow,” for 
certainly Blakston never used it as he does, and if I am not greatly mistaken his appeal 


* Cassell’s Canaries and Cage Birds was published in London, Paris, New York and Melbourne, 
some thirty years ago. 
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to German fanciers is equally at fault, for ‘hochgelb’ and ‘ weissgelb’ mean undoubtedly jonque 
and mealy (yellow and buff). In support of this let me give a note received on Dec. 1st, 1908, 
from that well-known fancier and scientific writer C. L. W. Noorduijn* of Groningen :— 

“T mean by buff-yellow, light or whitish yellow (mealy) (weissgelb), in contra-distinction to 
clear or orange yellow (hochgelb).” 

But even if we allow Dr Davenport to have his use of the term yellow, the mere fact that 
his matings take no account of the rudimentary distinction between the jonques and mealies, is 
amply sufficient to seriously detract from their value. 


The following statements on Davenport’s pages 6 and 7 must also be corrected :— 

(1) That Hervieux in 1713 gives only 10 varieties (p. 7). 

(2) That during the first half of the eighteenth century, the number of colour varieties was 
greatly increased, since Hervieux, in his edition of 1766, recorded 29 colour varieties (p. 7). 

And (3) that the first edition of Hervieux was published in 1705 (p. 6). 

The facts are as given in my “Canary Breeding ” :— 

(1) That there is no 1705 edition--the first one appearing in 1709. 

(2) That in the 1709 edition, 28 varieties are mentioned. 


This list also disproves the conjecture “that the frizzled characteristic is probably relatively 
recent, as it is relatively uncommon” for it occurs among the 1709 varieties. 


It is perhaps superfluous to refer to the meaning of the word “ frequently” ; suffice it to say 


that the occurrence of 1 clear mule in 526 (see table on p. 4 of my “Canary Breeding”) cannot 
be included under that category. 


In the original draft of my paper, I found fault with Dr Davenport for translating the French 
word “gris” as green in his memoir (p. 7). This grumble was found wanting in the proof 
sheets, and I now refer to it only because in the Rejoinder the author finds fault with me for 
translating the same word “ literally gray” instead of “technically green.” The technicalities 
of science are worthy of reverence, but when they transform the colour grey into green, then 
there is something wrong, which should be corrected. 


In reply to a query in the Rejoinder, I may state that grey and cinnamon sports are stated 
to be closely allied by me, because of my experience in keeping and breeding them—as may be 
seen throughout my paper (e.g. greenfinch family, p. 32). 

With regard to the statement that I have not defined imperfect crest, it should not be 
necessary for me to refer to a whole paragraph on p. 2, where this is fully explained, nor to 
a special engraving on p. 24, which beautifully illustrates the modern crest of various degrees of 
perfection. 


My “unanalytical grouping” of the progeny of double-crests (2 crests to 1 crest-bred plain- 
head) is due to the fact that I could not assume the existence of homozygous crests which I had 
not found. 


Dr Davenport’s tirade against pure-breeding is extraordinary as coming from the pen of 
an ultra-Mendelian, and from a follower of the great Blakston, who devotes the whole of 
Chapter XIII. to the discussion and praise of Pedigree breeding. 


Let me again quote from the Rejoinder. “A few fanciers mean by ‘ pure-bred’ birds, birds 
having a characteristic @ all of whose ancestors for several generations have the same character- 


* Noorduijn discredits Davenport’s Mottling Theory, criticizes adversely his inferences from 
Hervieux and Russ, and completes the similarity to my own position by showing that the frilled 
canary is not ‘‘probably relatively recent, as it is relatively uncommon”; he likewise disagrees 
with Davenport’s crested experiments. (Kritische Besprechungen und Referate. Archiv fiir Rassen- 
und Gesellschafts-Biologie, Jahrg. 1909, 8S. 394.) 
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istic a. For such, the offspring of Crested Norwich x Crested Manchester Coppy* would be 
‘pure-bred.’” As the characteristic a must mean the crest and coppy (which however are 
very dissimilar) and as the rule in Crests and Coppies is to mate a crested bird with a crest- 


bred plainhead, a coppy with a coppy-bred plainhead, it is difficult to comprehend the sort of 
fancier to which reference is made. 


T must state, however, that our respective interpretations of the term “pure-bred” have 
nothing to do with the point at issue, which was my criticism of a result of Dr Davenport’s—on 
the ground that pure-bred birds had not been used—purporting to overturn one of Russ’, 


in which he (Russ) expressly stipulated that pure-bred birds were necessary. 


(See Davenport’s 
quotation, p. 15.) 


Concerning the “crude speculation” of my evolution theory, I sincerely hope that it will 
form the subject of serious discussion by competent authorities. I have many more facts 


in its support than could be produced in a paper which had to include such an amount of 
spade work. 


With one statement in the Rejoinder, I am in complete agreement with Dr Davenport, viz. 
“T would defy anybody to classify my canaries now.” He has succeeded in proving by his 
so-called scientific method, not only that a crestless bird is a perfect crest, but that it has 
a more perfect crest than three-fourths of the existing prize crested birds. By some ingenious 
allelomorphic scheme he might as easily prove that all bipeds were quadrupeds. From his false 
assumptions both in the matter of crest, and plumage-colour, it is quite impossible for him to 
arrive at any scientifically correct, or practically useful conclusion. 


III. Inheritance in Canaries: A Study in Mendelism. 
By DAVID HERON, M.A. 


In 1908 there appeared a papert on Inheritance in Canaries, by Chas. B. Davenport, 
Director of the Station for Experimental Evolution, Carnegie Institute of Washington, in which 
the author discusses the inheritance of form of crest and plumage-colour in canaries and con- 
cludes that those characters follow Mendelian rules. 


These results are accepted without criticism by Bateson{, and although Galloway§ has 
offered some criticism, it seems desirable that further attention should be called. to the methods 
used in this paper. 

It will not however be necessary to consider Davenport’s discussion of the inheritance of 
plumage-colour; his treatment of the inheritance of form of crest on pages 8 to 13 and the 
general list of matings in Table E, p. 24, so far as it deals with crest-form, wili be found quite 
sufficient for the present purpose, i.e. to measure the scientific weight of this paper. 


Before considering his results a strong protest must be made against the very defective 
condition in which they have been presented. From the material we can only arrive at a lower 
limit to the number of blunders in those few pages (in all 114 lines of text and four Tables) by 
noting those cases in which the tables are self-contradictory, e.g., when a bird is said to be 
crested in one table and non-crested in another. 


* A redundant term! 
+ Carnegie Institute of Washington Publication, No. 95. 
Evolution, No. 10. 


+ Mendel’s Principles of Heredity, pp. 37 and 43. 
§ Biometrika, Vol. vu. p. 1 et seq. 


Papers of the Station for Experimental 
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In Table E, on p. 24, the full list of matings is given. As it stands, it is far from satis- 
factory, and the list of errors in the statement of form of crest which can be detected from a 
study of this table alone, is a large one. Only 66 matings in all are given. 


In Experiments 501, 509, 616 and 723, a reference to Exp. 624 should be to Exp. 623. In 
Exp. 714, a reference to Exp. 625 should be to Exp. 624; Exp. 625 does not occur in this 
Table at all. In Exps. 716 and 725, a reference to Exp. 623 should be to Exp. 622. In Exp, 702, 
there is a mating with a bird which has no number at all. It is said to be a crested bird but 
there is no indication whether the crest is perfect or imperfect. Similarly no statement as to 
the presence or absence of baldness is made about Birds Nos. 50 (used twice), 269, and 240. 
3ird No. 110 used in Exp. 729, is not marked either for crest or baldness ; it comes from 
Exp. 604 but it does not occur in the columns in which crest and baldness are indicated although 
it is marked “Yellow” in the colour columns. To balance matters, however, Bird No. 192 in 
this Experiment (604) is given in the columns for crest but is dropped in the columns for colour, 
while Bird No. 112 is entered twice in the colour columns. In Exp. 703, Bird No. 67 is said to 
come from Exp. 10; it really comes from Exp. 505. In Exp. 703 also, three birds are said to 
have died young when considering crest and four when dealing with colour. In Exp. 503, Bird 
No. 40 comes from Exp. 405, not from 505 as stated. In Exp. 709, Bird No. 143 comes from 
Exp. 609, not from 607 as stated. In Exp. 405, Birds Nos. 39 and 40 are said to have “crest 
present”; in Exps. 503 and 511, No. 40 is said to have “crest absent,” while in Exp. 614, No. 39 
is said to have “crest absent.” As an example of an error of omission, there appears in 
Table II (Exp. 717), a mating between two birds which are not numbered ; this mating does not 
occur in Table E at all. 

Disregarding this serious list of blunders, however, we may now proceed to examine Tables I, 
II and III, and Davenport’s deductions from them. Since those tables can only have been 
made up from the General List of Matings, Table E, we shall assume that Table E, in so far as 
it is not self-contradictory, represents the facts of the case*. 

Dealing with presence or absence of crest, Davenport states (p. 8, 6 lines from bottom) that 
“crest is alternative in inheritance, for when crested and non-crested birds were paired, the 
offspring were either well crested or plain headed and there were no intergrades.” Davenport 
here uses “non-crested” as equivalent to “plain headed.” A few lines further on, however 
(p. 9, line 1), he says that Table I “gives an answer to the question whether non-crested heads 
are recessive to plain heads”; while in his conclusions on p. 23, he says that “crest is dominant 


over plain head.” In the second of those three quotations “plain heads” ought to read “crested 
heads.” 


Turning now to Table I, we find again many blunders here. 


For Exp. 514, read Exp. 509; for Exp. 613, read Exp. 614; for Exp. 614, read Exp. 615; for 
Exp. 615, read Exp. 616; for Exp. 623, read Exp. 622; for Exp. 624, read Exp. 623; for 
Exp. 711, read Exp. 712; for Bird No. 185 in Exp. 716, read Bird No. 186. 

Exp. 619 gave 12 non-crested birds, not 11 as stated. 


Further Bird No. 39 used in Exp. 614 (not Exp. 613 as stated) and said in Table I to have 
“crest absent,” is said in Table E, Exp. 405, to have “crest present” and in Table E, Exp. 614, 
to have “crest absent.” In Table I, however, the result of Exp. 405 is said to be two birds with 
“crest absent” and none with “crest present.” In Table E, the result of Exp. 405 is said to be 
two crested birds and none without crest. 


Similarly in Table E, Exp. 507 is said to give one bird with “crest present” (No. 71); in 


Table I however it is said to give one bird with “crest absent.” This bird No. 71 is not used 
again, so further light cannot be thrown on the matter. 


* Davenport states (p. 9, line 1) that Table I is “extracted from the general table.” 












































Miscellanea 405 


Again Exp. 726 in Table E, is said to be a mating of non-crested birds and thus ought to be 


included in Table I, while the father of Bird No. 142 used in Exp. 723 is crested, not non-crested 
as stated. 


Davenport states that out of 102°offspring of non-crested parents, all were non-crested. It 
is true that in this conclusion he is supported by Galloway but the point to be insisted on here 
is that this conclusion cannot be reached from his own data since, according to Table E, Birds 
Nos, 39, 40, and 71, were crested birds and came from non-crested parents. 


Turning now to Table II, we find again an extraordinary number of blunders. 


For Exp. 710a, read Exp. 711; for Exp. 713, read 714; for Exp. 515, read 513; for Exp. 625, 
read 624; for Exp. 712, read 713. 


Further, the results of Exps. 703, 604, 704 and 720, are not in accordance with the results of 
those experiments given in Table E. 


In Exp. 703, instead of the proportion of non-crested to crested, birds, 3:7, read 3:6; in 
Exp. 604, instead of 7: 1, read 7:2; in Exp. 704, instead of 0:7, read 0: 5; and in Exp. 720 
instead of 0: 1, read 0: 2. 


In Exp. 714 (not 713 as stated), Bird No. 240 is said to come from two crested parents. 
According to Table E, however, No. 240 is one of the “original stock of whose ancestry, con- 
sequently, nothing is known directly.” Similarly in Exps. 503 and 511, Bird No. 3 is said 
to come from a mating of crest and non-crest, while on p. 11 and in Table E it is said to be 
“original stock.” 

Davenport’s method of dealing with those tables is equally faulty. 


In part 1 of Table Il he deals with matings between crested birds. Now crested birds, on 
the assumption made by Davenport, that crest is dominant over absence of crest, may be either 
DD or DR, and if DD be mated with DD or DR all the offspring should be crested, while if DR 
be mated with DR, three-fourths of the offspring should be crested. 


It is important therefore that DD’s should be distinguished from DR’s. If any of the off- 
spring of a pair of crested birds are non-crested then both parents must be heterozygous ; but if 
all the offspring are crested, it by no means follows that one or both of the parents is homo- 
zygous. So long as only crested birds are produced we cannot distinguish between DD’s and 
DR’s. All we can do is to appeal to the laws of probability and estimate the chance that all the 
offspring of a pair of crested birds, which are really heterozygous, shall be crested. 

Now Davenport, as the author of a book on Statistical Methods, must be familiar with the 
various probabilities involved in those cases. He says that “at least two birds are homozygous 
in crest, No. 12 which has produced nine young all crested and No. 79 which has produced 
11 young all crested.” One is not surprised to find that according to Table E, No. 79 has pro- 
duced only nine young all crested and this correction must first be applied. Further he says 
that No. 126 which has produced four young all crested is “possibly homozygous” ; and that 
No. 9 has produced 16 crested birds and one non-crested. 

Let us consider the probabilities involved in those cases. What is the chance that a pair of 
DR’s, i.e. crested but heterozygous birds, will produce “n” young all crested? In the long run 
they will produce three crested birds to one non-crested bird, and thus the chances that 
n, n—1, n—2, etc. out of n will be crested are the successive terms of (3+1)" out of 4". 
Thus the chance that all out of nine shall be crested is (#)?=1 in 13. 

Now he actually gets in his experiments the proportion 16:1. What is the probability of 
such a combination arising? It is 

Ble ‘ 
i qua! in 23. 
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Also the combination three birds without crest out of three arises. The chance that this will 
arise is still more remote—1 in 64. 


Now Davenport rejects the combination 9: 0 which occurs once in 13 times while he finds 
among his material combinations whose chance of occurring is far more remote, and yet retains 
them ! 


Homozygous and heterozygous birds in such cases can only be distinguished by increasing 
the number of experiments, but in cases of doubt crested birds should be mated with non- 
crested birds as then the chance that all of nine for example shall be crested is only 
1 in 2°=1 in 512. To get the same probability when mating two crested birds, we must 
obtain 22 offspring. Davenport does not seem to recognise that whenever all the offspring are 
crested, whether there be only two or as many as nine, there is a possibility of one or both 
parents being homozygous. 


Davenport’s failure to test adequately the gametic constitution of the parents renders any 
appeal to his totals useless and a similar criticism applies to part 2 of the same table. All we 
can say is that in Table II, part 1, there are some matings which are DRxDR and others 
which may be DDx DD, DD x DR, or DRx DR, and that conclusions based on methods such as 
these are of no value. 


His treatment of the inheritance of baldness however is even more faulty, and in Table III, 
one is confronted at the outset with a series of blunders and mis-statements which are absolutely 
unparalleled. In Table III, there are given the results of 26 matings, and in the following 
cases the statements made are contradicted either in the same table or elsewhere in the paper. 


In Table A, Exp. 617, Bird No. 84 should be No. 82; and in Exp. 620, Bird No. 83 has a 
“perfect crest,” not “imperfect” as stated. Exp. 620 should therefore be included in Table C 
and not in Table A. In Table B, a, Exp. 505, Bird No. 34 is said to have a perfect crest ; 
in Exp. 513, C, 8, it is said to have an imperfect crest, while in Table E, Exp. 403, it is said to 
have a perfect crest. In Table B, a, Exp. 505, four of the offspring are said to be non-crested ; 
only two are given in Table E. In the same experiment (No. 505) Bird No. 7 is said to be with- 
out crest ; in Table C, 8, it is said to have a perfect crest, while in Tables I. and E it is said to 
be without crest. In Table B, a, Exp. 625 should be No. 624. 


In Table B, 8, among the offspring of Exp. 705, there are only three birds with perfect crest 
and not five as stated. In Exp. 714, in the same table, there is said to be one bird with im- 
perfect crest and six in all. In Table E, for this experiment there are four birds with perfect 
crest, one with crest absent, and two which died young. No bird is marked as having an 
imperfect crest. According to his own figures instead of totals of 11 perfect crests and four 
imperfect crests there are nine perfect crests and three imperfect crests together with seven 
birds which are without crests ; such birds are in the same table considered as perfect crested 
birds, i.e. they are not bald. There are thus 16 birds with perfect crests and three with 
imperfect crests. 


Further, no imperfect crested birds are known to have occurred in the ancestry of Birds 
Nos. 157 and 240, and to state as he does that such is “probably” the case is to beg the whole 
question. 


In Table C, a, Exp. 508, there should be one bird without crest. In Exp. 604, Bird No. 62 
has a “perfect crest,” not an “imperfect crest” as stated. This mating is therefore wrongly 
included in the Table. In Exps. 608 and 703, Bird No. 67 should have a “ perfect crest,” not 
an “imperfect crest” as stated. These matings should therefore be included in Table B and 
not in Table C. In Exp. 703, there are only two birds with perfect crest, not three as stated. 
In Exp. 704 there are only three birds with perfect crest, not five as stated. In Exp. 711 
there are two birds with perfect crest and none with imperfect crest, not one of each as stated. 
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In Table C, 8, in each of Exps. 513 and 621 (not 622 as stated) one bird without crest is 
dropped. 


The climax is however reached in dealing with two matings between Birds No. 6 and 9. 


In Table C, a, Exp. 510, this is said to be a mating of DRx RR; in Table C, 8, Exp. 504, to 
be a mating of DDxRR. In the first case it is said to give four imperfect crested birds and 
none with perfect crest; in the second case it is said to give two perfect crested birds and one 
bird without crest. In the first case No. 6 is said to have an imperfect crest and No. 9 a 
perfect crest; in the second case No. 6 is said to have a perfect crest and No. 9 an imperfect 
crest. On p. 11, line 32, No. 9 is said to have a perfect crest and No. 6 an imperfect crest. 
The extraordinary versatility of these two birds is decidedly embarrassing and one is somewhat 
surprised to find that the tables in which such blunders occur are headed “ Matings to Test 
Inheritance of Imperfect Crest.” Davenport makes a similar blunder in dealing with two 
matings between Birds Nos. 7 and 34. It has already been pointed out that No. 7 has at one 
time no crest and at another time a perfect crest, and that No. 34 has at one time a perfect crest 
and at another time an imperfect crest. So also the mating 7 x 34 in Exp. 505 is said to be one 
of DDxDD or DD xDR, and in Exp. 513 the same mating is said to be one of DD x RR! 


Again, Table B is divided into two sections a and 8. Section a deals with matings DDx DD 
and DDxDR, while section 8 deals with the mating DRxDR. But the distinction is purely 
arbitrary; as has already been explained if a mating of birds with perfect crest results in one 
bird with imperfect crest, then the mating must be DRxDR but further we cannot go. The 
same criticism applies to his division of Table C into two parts. 

Various conclusions from Table III are given on p. 13, e.g. “Table III shows that when 
two imperfect crested birds are mated, the offspring have imperfect crests (Table III, A). This 
indicates that absence of occipital feathering is recessive to its presence.” He gives those 
matings in Table III, A, but omits Exp. 403 which is a mating between Birds Nos. 21 and 13. 
These birds are said on p. 11 to have imperfect crests. The result of Exp. 403 is, one bird with 
crest absent and one with crest present and perfect (No. 34). It has already been pointed out 
that this bird is stated twice to have a perfect crest and once to have an imperfect crest. The 
balance of probability being in favour of its having a perfect crest, the conclusions drawn are 
not justified and are flatly contradicted by Davenport’s own material. Further, Exp. 620 
should be deleted from Table A and placed in Table C, since Bird No. 83 has a “perfect 
crest” and not an “imperfect crest” as stated. 


Again on p. 13 we find “when both parents have a perfect crest (being therefore DD or DR) 
all the offspring have the perfect crest (Table III, B, a).” This of course would only be the 
case for the matings DD x DD or DDx DR as stated in the Table, but the heading of Table III, B 
is “Neither parent has imperfect crest” and the table actually contains an incomplete list of 
matings in which at least one parent has a perfect crest and in which one parent has (in seven 
cases out of nine) no crest at all. These statements are all consistent with each other if non- 
crested birds are considered as having perfect crests, but Davenport excludes them in counting 
the offspring. It is quite justifiable for Davenport to group together birds with perfect crest 
and birds with no crest since neither according to him have the bald spot on the occiput, but, if 
he does so in considering the matings, he must also do so in counting the offspring, and this he 
fails to do. He actually leaves out of account altogether 39 birds out of 112. 


It is impossible to tell what he intended to include in this table; he gives two cases of 
mating of perfect crest with perfect crest and seven cases of perfect crest with absence of crest. 
But if this is what he means, the list is far from complete. Assuming that his statements as 
to perfect crest, imperfect crest or absence of crest as given in Table E are correct, the following 
experiments must be added, Nos. 402, 502, 513, 604, 606, 608, 611, 701, 703, 709, and perhaps 
514 and 702. 
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In those last two experiments it is not stated whether the crest is perfect or imperfect in 
the case of at least one of the parents, Birds Nos. 50, 269, and the mate of 269 which has no 
number. 

Of this list, Nos. 513, 604, 608 and 703 are given in Tables III, C, a and 8, but their inclu- 
sion is in flat contradiction to Table E. 

Table III, C, a, is stated on p. 13 to consist of matings between perfectly feathered individuals 
that are probably hybrids between dominants and recessives, i.e. DR’s and imperfectly feathered 
consorts. The heading of the Table is, however, “One Parent has Imperfect Crest.” According 
to Table E, however, Bird No. 34 has a perfect crest and Exp. 513 should thus be included in 
Table B. As before, however, “absence of crest” is considered as equivalent to “perfect crest” 
in dealing with the parents, but birds without crest are rejected in counting the offspring. 

In Table III, C the following should also be included: Exps. 511, 601, 620, 621 and perhaps 
514, 613, 702 and 714, but absence of information in Table E leaves the matter in doubt. 

The gametic scheme on p. 13 which assumes that in the crest there are two pairs of allelo- 
morphs is not justified. It required the presence of bald plainheads, and out of Davenport’s 
154 non-crested birds none have been found to be bald. Galloway also says (Biometrika, Vol. vit. 
p. 25) that no bald plainheads have occurred in his experience. 

The material used by Davenport is also unsatisfagtory. Of the birds used in those experi- 
ments some are said to have been of the “Norwich” type, and in Plate I, Fig. 2, in Davenport’s 
paper, is given an illustration of a crested Norwich canary. This is however so far removed from 
what is known as a Norwich crest that it seems desirable to consider the point further. I have 
examined carefully every Norwich canary exhibited at the 1910 exhibitions of the Scottish 
National Cage Bird Society at Edinburgh and of the London and Provincial Ornithological 
Society at the Crystal Palace, some hundreds in all, and saw no bird at all like that shown 
in Fig. 2. Such birds sometimes appear but they are rare and are certainly not representative of 
the class. 

There are three well-marked types of Norwich canaries: the Norwich plainhead, the crest- 
bred plainhead, and the Norwich crest. These three birds are quite distinct and cannot be 
confused by anyone who has ever seen them. The crest-bred plainhead and the Norwich crest 
are usually bred together, but the Norwich plainhead is never bred with the other two. Now it 
is not clear whether Davenport’s non-crested birds are plainheads or crest-bred plainheads or 
indeed whether he is aware of the distinction between these two types, but that they are quite 
distinct can be seen clearly from the illustrations given here. In Fig. 1, we have a typical 
Norwich plainhead. It is very neat in appearance and shows no trace of overhanging eyebrows 
and offers a strong contrast to Fig. 2 which is a crest-bred canary. In this bird the head feather 
is very dense, and when it is turned forward with the finger it reaches right over the tip of the 
beak. The illustration also shows the typically heavy eyebrows of this class. 

In Fig. 3 is given an illustration of a Norwich crest. The crest is very regular and radiates 
from a well-defined centre ; the eyes and beak are completely covered by the crest and it is clear 
that this bird has very little in common with the “Norwich crest” illustrated by Davenport. 

It is obviously of the highest importance that the existence of these three distinct types should 
be recognised. In respect of crest the crest-bred plainhead occupies a distinctly intermediate 
position and cannot be classed either with the Norwich crest or with the Norwich plainhead. 

It has thus been shown that every conclusion made by Davenport can be proved to be false 
from a study of his own material; that if a fact has to be stated twice the one statement is 
flatly opposed to the other and that blunder is heaped on blunder until patience is exhausted. 


Yet such work is accepted as showing that Mendelian rules apply to Canaries! (See Bateson, 
loc. cit. pp. 37 and 43.) 


Before the cautious man of science can be convinced of the truth of Mendelism, it will be 


necessary that in Mendelian writings a much higher standard of observational and statistical 
accuracy should be employed. 











Miscellanea 409 


TYPES OF NORWICH CANARIES. 








Fig. 1. Norwich Plainhead. 
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Fic. 2. Crest-bred Norwich Plainhead. 





Fic. 3. Norwich Crest. 
Reproduced by kind permission of the Proprietors of Canary and Cage Bird Life. 
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Abac for determining the Probable Errors of Correlation Coefficients. 


























». 








BIOMETRIC NOTES. 


I. An Abac for determining the Probable Errors of 
Correlation Coefficients. 


By DAVID HERON, M.A. 


THE probable errors of correlation coefficients are so often required that the accompanying 
abac which enables them to be determined at once will save considerable labour. From this 
abac the probable errors can be read off correct to at least two decimal places; greater 
accuracy is seldom required. 

The principle on which the abac is constructed is simple. If we take the well-known 
formula for the probable error of a correlation coefficient 

67449 (1 — 7°) 
, \ 
E,=— ; 


Jn 


where 7 is the correlation and x the number of observations, and express it in logarithmic 
form, we get 

log (#,.) = 1°8289755 + log (1 — 7°) — $ log n. 
For any constant value of 7 there is thus a linear relationship between the logarithms 
of £, and »n. Hence by plotting £, and x on logarithmic scales and ruling lines for a 
sufficient number of values of 7, we can find #, from 7 and x without difficulty. 

The use of the abac can best be illustrated by an example. Let the correlation between 
two variables be 55 and the number of observations 160; then to find the probable error 
of 7, we must read along the perpendicular line from the number 160 on the scale of frequency 
until it crosses the diagonal line representing a correlation of 55. The position of the 
point of intersection of these two lines is then read, by aid of the vertical lines, on the scale 
of probable errors and the value ‘037 so obtained. 

After the diagram had been drawn in pencil, the whole of the laborious work of ruling 
in the lines in ink and preparing the diagram for reproduction was undertaken by Miss H. 


‘ 


+. Jones, and I have to thank her most heartily for her careful work. 


II. On the Probable Error of a Partial Correlation Coefficient. 
By DAVID HERON, M.A. 


In a paper “On the Theory of Correlation for any Number of Variables, treated by a New 
System of Notation,”* G. Udny Yule has suggested that the probable error of a partial correla- 
tion coefficient, giving the correlation between two variables for a constant value of a third, is of 
the same form as the probable error of the direct correlation between any two variables, but his 
proof by reason of its very general nature is by no means easy to follow and it seems desirable 


* Proc. R.S., Vol. 79, pp. 182 et seq. 
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to investigate the probable error of this coefficient on the lines laid down by Pearson and Filon 
in their classic paper on the probable errors of frequency constants*, 

Let sp2 be the correlation between variables 1 and 2 for a constant value of 3. 

Then 


712 — 713723 
3P12>= = —,° 
Jl —113" V1 ae 


Taking logarithmic differentials, we find, 





d spp drys aArys (713712 — 723) 
= + 


Ars (723712 — 713) 





3P12 M2—-713723 (712 —713723) (1 —743) 





Squaring, summing and dividing both sides by x, the total number of observations, and 
substituting for the standard deviation of the errors in 79, 713, 723, and for the correlations of 
errors in those quantities the values given by Pearson and Filon in the paper already cited, we 
find after striking out the common factors (1 — 743”) (1 — 723”) 


2*spr2 (712-113 723)” (1 — 73") (1 — 723") - 
3p12" ; 
= (1 — 142? — 743? — 7297+ 2712713793) (1 — 728”) (723 — T12713) MieTi3 
+(1 = 713") (713 — 112723) M1223 — (713 — 712723) (723 — 712713) 713723} 
+ (1 — 943") (1 — 7793”) {(1 — 112”) + (713 — 712723)? + (723 — 112713)? — 2793 (1 — 7412”) (7723 — M2713) 
— 2 (1—719”) (713 — 712723) 713 + 2 (713 — 712723) (7237127 13) N12} 


3ut the second part of the expression to the right of the sign of equality reduces to 


(1 = 743") (1 — 723”) (1 — 719") (1 — 712° = 713" — 72g" + 2712713723) 


3° 23/9 


and the second factor of the first part reduces to 


L = 19? — 743" — 129? + 2712713723 — (L — 719") (1 — 743”) (1 — 1257), 


so that the whole expression on the right reduces at once to 

(1 — 719? — 7137 — 723" + 2712713723)". 
But 

("12 —113723)" 


(1 — 143") (1 — 12, ) 


9 
3sPi2z >= 





so that 
Lng hs 3 


Jn (1-13 


2+ Wry27 13 23 
2) (1 — 723”) 





23p12= 


= i- 3p12" 
/n 7 
and the probable error of sp,» is 


Jn 


_ 67449 (1 = p12") 


and is thus, as stated by Yule, of the same form as the probable error of the direct correlation 
between any two variables, 


* «On the Probable Errors of Frequency Constants and on the Influence of Random Selection on 
Variation and Correlation.” 


Vol. 191, pp. 229 et seq. 


By Karl Pearson, F.R.S., and L. N. G. Filon, B.A., Phil. Trans. R.S., A, 
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BOOK NOTES. 


Interpolationsrechnung, by T. N. TH1eELE. Leipzig. B. G. Teubner, 1909, M. 10. 


This interesting work which is divided into four chapters begins by evolving the general inter- 
polation formula 
X=A+(x-—a) [8 (a, b)+ (w—b) {8" (a, ..., ce) +(v—e) {8'" (a, ..., A+...}}] 
, , 
where o' (2, a)= a~s and 8” (z,..., 6)= Fae-te J etc. 
r-—a a—b 

This formula does not depend as Newton’s formula does on the tavulation of the function for 
even differences in the argument and in its practical application one differences the function in 
the usual way and then divides the differences so obtained by the corresponding difference in the 
argument. (Cp. De Morgan’s Differential and Integral Calculus, Ch. xvu1., on Interpolation and 
Summation.) The remainder of the first chapter deals with Newton’s Method and with series and 
the problems arising from them. The second chapter is devoted to symbolic work and although 
helpful and interesting in places it is rather overburdened with notation, and the third chapter 
gives auxiliary methods such as the graphic and explains the use of a “qualifizirte Differenz ” 
which is based on expressions formed symbolically, (Z-a)f(«)=f(#+1)—af(x) for the first 
difference and (£?—2cbE+b?) f (x) =f (a+2)—2cbf(w+1)+b?f(x) for the second difference and 
so on. These differences are applied in connection with exponential and periodic functions. 
The last chapter of the book deals with interpolation when there is more than one independent 
variable, but contains no new suggestions. 

Interpolation is perhaps chiefly serviceable in its arithmetical applications and we can recom- 
mend the present work not only for its examples on this part of the subject but also for its 
interesting theoretical treatment. 

WwW. 2 x 


The Theory of the Construction of Tables of Mortality. A course of Lectures by 
G. F. Harpy, F.I.A., Delivered during the Session 1904—5. Published 
for the Institute of Actuaries by C. and E. Layton, 1909. 


These Lectures which were intended mainly for Actuaries deal with graduation and the 
problems arising from it. They recount briefly the older methods of graduation such as the 
Graphic and Woolhouse’s Difference Method but are chiefly concerned with the fitting of curves 
and with the practical use of Makeham’s hypothesis for graduating a mortality Table. 

It is a pity that owing to the omission of those parts of the subject with which his audience 
was familiar and to the fact that Mr Hardy has not had time to enlarge what were notes for 
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a larger work the Lectures in their published form appear a little disjointed. This is unfortunate 
as it gives an impression of hurry which is enhanced by the fact that there is neither an index 
nor list of contents ! 


To biometricians Mr Hardy’s treatment of frequency curves will not we think pass without a 
certain amount of criticism. On p. 39 the differential equation from which the Pearson-type 
curves are evolved is given as 

1 dy ba — x? 
y dx ~a—batcx’ 


which does not seem to lead to Type I, while the curves have been renumbered simply according 
to range which seems unsatisfactory, as, if we are to give up a numbering to which we are 
accustomed, it is well to adopt one which will not call for revision. This could have been done 
either according to the number of terms in the denominator of the right-hand side of the funda- 
mental equation or according to the number of moments required, e.g. the curves requiring two 
moments numbered II a, II b, ete., and then further curves could be added if for any class of 
statistics a further term had to be included in the differential equation. 


There are one or two other details with which we find it hard to agree ; for example, on p. 46, 
Type III should we think have been used to fit the binomial series, while we find it hard to 
follow Mr Hardy’s statement that Types III and IV are not specially useful to Actuaries, the 
reason given is hardly convincing, and we have found both curves in not a few cases from actuarial 
data. 


To most people interested in graduation however the part of Mr Hardy’s book which will be 
most attractive will certainly be his account of his own graduation of the select tables (i.e. tables 
giving the effect of medical selection) based on the new British Offices experience. Prior to this 
graduation there had we believe been no published graduation of a select table by any curve- 
fitting process, and though the two diagrams (pp. 74 and 75) do not betoken a very close fit it 
must be borne in mind that it is on the sum of the two curves that the agreement really depends, 
and little fault can be found with the resulting graduation. But apart from this one cannot 
expect a perfect method of graduation to be reached immediately, and Mr Hardy has probably 
done far more in actuarial circles for scientific graduation than most actuaries realise. 

Our disagreement with some of the remarks made in Mr Hardy’s book does not obscure but 
rather tends to accentuate in our minds the value of many of his actuarial suggestions, while we 
hope his work will stimulate other Actuaries to a study of general curve-fitting, so that the old 
methods which have been used so freely in the past may be displaced by those which Mr Hardy 
more than any other actuary has brought before the notice of his profession. 


i oe 
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